Quartz 4

Home

❯

notes

❯

PLMs downweigh probability of sequences with multiple mutations

Created Jul 02, 2024Modified Apr 21, 2026

variant-effect-prediction

Summary

PLMs are biased against sequences with multiple mutations (1).

Details

The authors propose normalizing these by generating large quantities ( $1 0^{4}$ ) of mutants with equal number of mutations and ranking them that way.

Figures

Ref (1)

1.

Shaw A, Spinner H, Shin J, Gurev S, Rollins N, Marks D. Removing bias in sequence models of protein fitness. openRxiv; 2023. Available from: https://doi.org/10.1101/2023.09.28.560044

Graph View

Summary
Details
Figures

Backlinks

Sequences with lower log-likelihoods are worse for zero-shot variant effect prediction using PLMs
Zero-shot protein stability prediction using inverse folding models can be improved by subtracting predictions from residue in isolation

GitHub
Discord Community