Summary

Averaging logits from multiple DL models can lead to substantial improvements in fitness and stability prediction. The method TranceptEVE, which combines probabilities from EVE and Tranception, is the state of the art method for predicting protein fitness and thermostability (as judged by ProteinGym; (1)). Likewise, adding probabilities from ESM-IF further improves this method (2). (3) found that ensembles of ProGen models outperform individual models on a range of tasks.

Figures

Model nameExpressionBindingActivityStabilityOrganismal fitnessMean
N. Assays610316869n/a
EVE (ens.)0.4600.3220.4530.4180.4700.425
GEMME0.3820.3610.4900.5180.4690.444
ProGen2 (ens.)0.4590.2980.4170.4220.4460.408
VESPA0.4830.3780.4720.5010.4680.460
ProteinMPNN0.1620.1480.2130.5550.1770.251
ESM-IF10.4010.3750.3870.6300.3680.432
Tranception L0.4410.3290.4620.4730.4590.433
TranceptEVE0.4810.3410.4820.5020.4780.457
StructSeq0.5490.3990.4980.6330.4680.509

Ref (2)

See also

1.
Notin P, Van Niekerk L, Kollasch AW, Ritter D, Gal Y, Marks DS. TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. openRxiv; 2022. Available from: https://doi.org/10.1101/2022.12.07.519495
2.
Paul S, Kollasch A, Notin P, Marks D. Combining Structure and Sequence for Superior Fitness Prediction. In: GenBio@NeurIPS2023. 2023. Available from: https://openreview.net/forum?id=8PbTU4exnV
3.
Nijkamp E, Ruffolo JA, Weinstein EN, Naik N, Madani A. ProGen2: Exploring the boundaries of protein language models. Cell Systems. 2023;14(11):968-978.e3. Available from: https://doi.org/10.1016/j.cels.2023.10.002