Summary

Antibody language models trained on paired heavy and light chains outperform equivalent models trained only on unpaired data (1). They also outperform generic PLMs.

Figures

ModelFWH1FWH2FWH3FWH4CDRH1CDRH2CDRH3Total VH
AbLang (Olsen et al., 2022b)0.97950.96670.95600.98080.90990.88450.59260.9105
AntiBERTy (Ruffolo et al., 2021)0.97840.96530.95450.97750.90730.88210.52090.8998
ProtBert (Elnaggar et al., 2022)0.80180.76070.73840.84630.65600.45560.27720.6821
IgBert-unpaired0.97910.96550.95520.97980.90430.88410.59240.9099
IgBert0.98100.96900.95760.98090.91300.88650.60120.9129
ProtT5 (Elnaggar et al., 2022)0.90370.85390.88800.91420.75300.62920.33900.7932
IgT5-unpaired0.97900.96710.95600.98250.90920.88390.60350.9121
IgT50.98200.96870.95740.98280.91500.89360.61960.9163
ModelFWL1FWL2FWL3FWL4CDRL1CDRL2CDRL3Total VL
AbLang (Olsen et al., 2022b)0.96630.96830.97070.96210.89110.90080.83850.9493
AntiBERTy (Ruffolo et al., 2021)0.97860.96870.97480.96610.90660.89510.84440.9553
ProtBert (Elnaggar et al., 2022)0.65970.78620.78270.63370.46900.43820.29010.6654
IgBert-unpaired0.98040.97040.97390.96560.90810.89850.84610.9560
IgBert0.98850.97380.98070.97400.92320.91490.86340.9647
ProtT5 (Elnaggar et al., 2022)0.84560.90100.87990.84990.69610.60380.51720.8200
IgT5-unpaired0.98090.96750.97520.91710.90760.90930.84230.9515
IgT50.98780.97350.98150.97840.92220.91630.86930.9656

Ref (1)

1.
Kenlay H, Dreyer FA, Kovaltsuk A, Miketa D, Pires D, Deane CM. Large scale paired antibody language models. PLOS Computational Biology. 2024;20(12):e1012646. Available from: https://doi.org/10.1371/journal.pcbi.1012646