Hybrid sequence-structure models combine protein language models with inverse folding models to draw from both massive sequence databases and highly detailed structural data.

Methods

  • GEARnet-ESM
  • MIF-ST
  • ProstT5 (1)
  • SaProt
  • InstructPLM
  • CarbonDesign (2)
1.
Heinzinger M, Weissenow K, Sanchez JG, Henkel A, Mirdita M, Steinegger M, et al. Bilingual language model for protein sequence and structure. NAR Genomics and Bioinformatics. 2024;6(4). Available from: https://doi.org/10.1093/nargab/lqae150
2.
Ren M, Yu C, Bu D, Zhang H. Accurate and robust protein sequence design with CarbonDesign. Nature Machine Intelligence. 2024;6(5):536–47. Available from: https://doi.org/10.1038/s42256-024-00838-2