Summary

Fine-tuning Protein language models using rank information is more effective than fine-tuning using absolute property measurements (regression; (1)). It also converts a dataset of sequences to pairwise comparisons, while generalizing better to unseen positions.

Figures

Model NameScoring FunctionLosssingle n=32single n=128single n=512multi n=32multi n=128multi n=512
ESM-1v (650M)linear headmse0.2630.4150.5350.4940.6370.771
wt-marginalsranking0.4790.5520.6410.5770.6420.753
ESM-2 (650M)linear headmse0.2800.3980.5350.4270.5960.743
wt-marginalsranking0.4550.5300.6270.5930.6510.758
PoETlinear headmse0.4430.5530.6460.5710.7140.793
likelihoodranking0.5130.5910.6720.6670.7370.806
ProteinNPT (MSAT)mse0.4150.5330.6370.5170.6920.791
ProteinNPT (ESM-1v)mse0.4100.4970.6070.4380.6450.769
Emb. aug. (MSAT)mse0.4240.5070.5530.5810.6960.764
Emb. aug. (ESM-1v)mse0.4510.5050.5500.4400.6240.702
OHE aug. (MSAT)mse0.4290.4670.4960.6160.6840.763
OHE aug. (ESM-1v)mse0.4660.5020.5260.4600.5660.711
OHEmse0.1440.3140.4880.2680.4730.664
ModelScoring fn.LossSeenUnseen
ESM-1v (650M)linear headmse0.4600.331
wt-marginalsrank0.5920.509
ESM-2 (650M)linear headmse0.4530.297
wt-marginalsrank0.5680.455
PoETlinear headmse0.5710.517
likelihoodrank0.6120.549
PNPT (MSAT)-mse0.5630.462
PNPT (ESM-1v)-mse0.5290.420

Tables from (1)

See also

1.
Hawkins-Hooker A, Surana S, Simons J, Kmec J, Bent O, Duckworth P. Likelihood-based Fine-tuning of Protein Language Models for Few-shot Fitness Prediction and Design. openRxiv; 2024. Available from: https://doi.org/10.1101/2024.05.28.596156