Summary
BERT-based inverse folding models generate overly repetitive sequences (1). This can be avoided by retraining the models with custom losses that look at overall sequence composition.
Details
The loss function used to improve repetitive sequences is as follows:
Figures
Ref (1)
1.
Kim N, Kim M, Ahn S, Park J. Decoupled Sequence and Structure Generation for Realistic Antibody Design. Transactions on Machine Learning Research. 2025; Available from: https://openreview.net/forum?id=CTkABQvnkm