Correlation between sequence log-likelihood and variant effect prediction performance breaks down as PLMs get larger

Summary

The correlation between a sequence’s log-likelihood and the performance of variant effect prediction using protein language models breaks down as the PLMs get larger (1). This happens specifically due to poorer performance with high-log-likelihood sequences.

Figures

Ref (1)

Quartz 4

Explorer

Correlation between sequence log-likelihood and variant effect prediction performance breaks down as PLMs get larger

Summary

Figures

See also

Graph View

Backlinks