Summary

Antibody-specific protein language models are worse for antibody expression prediction than generic PLMs (1).

Figures

ModelBinding N = 422 (Shanehsazzadeh et al., 2023)Binding N = 2048 (Warszawski et al., 2019)Binding N = 4275 (Koenig et al., 2017)Expression N = 4275 (Koenig et al., 2017)
AbLang (Olsen et al., 2022b)0.293 ± 0.1170.246 ± 0.0380.244 ± 0.0340.439 ± 0.027
AntiBERTy (Ruffolo et al., 2021)0.239 ± 0.1020.217 ± 0.0560.199 ± 0.0250.401 ± 0.032
ProtBert (Elnaggar et al., 2022)0.200 ± 0.1060.149 ± 0.0240.101 ± 0.0170.491 ± 0.029
IgBert-unpaired0.278 ± 0.0940.181 ± 0.0400.177 ± 0.0180.347 ± 0.023
IgBert0.306 ± 0.1140.131 ± 0.0470.174 ± 0.0320.400 ± 0.023
ProtT5 (Elnaggar et al., 2022)0.290 ± 0.1050.186 ± 0.0370.206 ± 0.0290.697 ± 0.02
IgT5-unpaired0.299 ± 0.1190.245 ± 0.0490.179 ± 0.0140.567 ± 0.025
IgT50.274 ± 0.0700.297 ± 0.0570.25 ± 0.0190.548 ± 0.067

Ref (1)

See also

1.
Kenlay H, Dreyer FA, Kovaltsuk A, Miketa D, Pires D, Deane CM. Large scale paired antibody language models. PLOS Computational Biology. 2024;20(12):e1012646. Available from: https://doi.org/10.1371/journal.pcbi.1012646