proteingym-agent
GitHub ↗

DMS substitution leaderboard

Matched ProteinGym DMS substitution splits for black-box LLM rankers and specialized biomolecular baselines. Scores are nested-macro Spearman ρ; LLM assay scores are averaged across frozen seeds before aggregation. The Spearman SE column is the standard error across frozen split seeds. Click any column header to sort.