Summary

Predicting the structure of antibodies using embeddings from antibody language models leads to equal or worse performance compared to using embeddings from generic protein language models (1,2). The former found that training IgFold with with ESM2-35M embeddings gave comparable performance to using the AntiBERTy embeddings used by default, while the latter obtained better performance on ABodyBuilder3 ProtT5 embeddings compared to IgBERT and IgT5.

Figures

MethodEmbedderEmbedder StatusMeta-archlDDT-CαOCDH FrH1H2H3L FrL1L2L3
IgFold(paper)3.770.450.800.752.990.450.830.511.07
IgFold(reproduced)Antiberty(25M)FreezeIGFold0.933.740.570.920.803.090.671.120.551.15
IgFold-variant1ESM-2(35M)FreezeIGFold0.923.760.620.870.943.060.490.900.511.15
IgFold-variant2ESM-2(650M)FreezeIGFold0.933.770.480.910.943.200.480.940.491.13
IgFold-variant3ESM-2(35M)TrainableIGFold0.933.880.510.890.853.140.511.000.501.10

Ref (1)

1.
Lee J, Han K, Kim J, Yu H, Lee Y. Solvent: A Framework for Protein Folding. 2023; Available from: https://arxiv.org/abs/2307.04603
2.
Kenlay H, Dreyer FA, Cutting D, Nissley D, Deane CM. ABodyBuilder3: improved and scalable antibody structure predictions. Bioinformatics. 2024;40(10). Available from: https://doi.org/10.1093/bioinformatics/btae576