SaProt is a protein language model that is trained on both amino acid identities and Foldseek tokens from AlphaFold2 structures. As of mid-May 2024 it outperforms all other methods on Variant effect prediction.
Details
- Structural tokens for residues with pLDDT values less than 70 are masked.