Summary

Protein backbone design models that use diffusion undersample alpha/beta topologies and loop regions, particular those found in functional motifs found in enzymes, and oversample helical domains not found in CATH (1). This includes designable sheet-rich domains like immunoglobulins and is consistent across all diffusion models tested.

Figures

Ref (1)

See also

1.
Lu T, Liu M, Chen Y, Kim J, Huang P-S. Assessing generative model coverage of protein structures with SHAPES. Cell Systems. 2025;16(8):101347. Available from: https://doi.org/10.1016/j.cels.2025.101347