[New Model] PoET-2 for DMS Zero-Shot Benchmarks#89
[New Model] PoET-2 for DMS Zero-Shot Benchmarks#89timt51 wants to merge 2 commits intoOATML-Markslab:mainfrom
Conversation
|
Bonjour cher @pascalnotin, I was wondering if you had the time to look at this? Integrating this model would be useful. Thanks. |
|
I realized that I may have needed to create an accompanying issue, so here's that issue: #93. |
|
Thank you for the PR @timt51 ! |
|
Happy New Year! |
|
Hi @pascalnotin, The discrepancy was due to an issue with downloading structures from AFDB. The scoring script previously attempted to download structures from AFDBv4, but most of those links are now broken. I have updated the scoring script to download structures from the latest version of AFDB, AFDBv6. Deleting the following directories, if present, and then rerunning the scripts should reproduce the performance numbers reported in the PR description within ~0.001. DMS: I have also updated the PR description with links to the new scores produced by running the updated scoring script on the DMS benchmarks. |
|
@pascalnotin Bonjour, just wondering if you had had the time to look at this, thanks! |
This PR adds a new baseline model, PoET-2, for the DMS (substitutions and indels) and Clinical (substitutions and indels) unsupervised benchmarks.
⚙️ Setup
Download Model Weights & MSAs
To download the model weights and the Multiple Sequence Alignments (MSAs) required for predictions, run the following commands:
cd proteingym/baselines/PoET-2 make downloadThis will save the model weights to
~/.cache/ProteinGym/baselines/PoET-2and the MSAs to~/.cache/ProteinGym/baselines/PoET. Note that the MSAs are the same as those used for PoET(-1).🚀 Running Inference
The scoring scripts for each benchmark are detailed in the table below.
scripts/scoring_DMS_zero_shot/scoring_PoET_2_substitutions.sh${DMS_output_score_folder_subs}PoET-2scripts/scoring_DMS_zero_shot/scoring_PoET_2_indels.sh${DMS_output_score_folder_indels}PoET-2scripts/scoring_clinical_zero_shot/scoring_PoET_2_substitutions.sh${clinical_output_score_folder_subs}PoET-2scripts/scoring_clinical_zero_shot/scoring_PoET_2_indels.sh${clinical_output_score_folder_indels}PoET-2These scripts will automatically download predicted protein structures from AlphaFoldDB, which can take a significant amount of time. The structures are saved to the following cache directories, requiring approximately ~12 GB for the DMS benchmark and ~310 GB for the Clinical benchmark:
${PROTEINGYM_CACHE}/baselines/PoET-2/DMS_AF2_structures_cache${PROTEINGYM_CACHE}/baselines/PoET-2/clinical_AF2_structures_cacheTo download the structures without running model inference, you can first run the scripts with the
SAMPLE_PROMPTS_ONLY=1environment variable.By default, all scoring scripts will attempt to utilize every GPU available to them.
💡 A Note on GPU Usage
While the scripts default to using all available GPUs, the nature of the two benchmarks lends itself to different parallelization strategies. You may want to consider the following for the most efficient inference:
📊 Performance
Using the provided evaluation scripts, we obtained the following performance:
0.5000.5730.9320.9490.945* The set of indels on the public leaderboard is a subset of all available variants in the benchmark's downloadable data.
🔗 Precomputed Scores
Precomputed PoET-2 predictions can be downloaded from the following links. In each file, the score is located in the final column, named
PoET-2.Update on 2026/01/26
Outputs from
performance_DMS_benchmarks.pyfor DMS benchmarks (AFDB v6) are also available here.