Summary

Antibody-specific protein language models outperform generic PLMs on intrafamily but not general thermostability prediction (1). A version of ProGen specifically trained on antibody sequences outperform generic ProGen models on intra-family thermostability prediction. On inter-family prediction, they are bested by ESM-IF (see Structure-based methods outperform sequence-based methods on protein stability prediction of point mutants, but not full sequences).

Details

One related observation (unpublished as of 19 April 2026) is that the mean-pooled CDRH3 embeddings learned by generic LMs, but not antibody LMs, are basically meaningless insofar as they match those of scrambled CDRH3 sequences with the same framework. A separate theory is that this could be because antibodies are separated by V-gene by antibody LMs.

Figures

Figure 5 from (1)

See also

1.
Chungyoun M, Ruffolo J, Gray J. FLAb: Benchmarking deep learning methods for antibody fitness prediction. openRxiv; 2024. Available from: https://doi.org/10.1101/2024.01.13.575504