Sequence-based DL models learn intra-family phylogenetics of protein evolution

Summary

DL models for protein sequences (including PLMs) learn the phylogenetics of protein evolution. (1) showed this in the 2D latent space of protein family-specific Variational autoencoders (below); whereas (2) showed how repurposing perplexity as velocities within a manifold of sequence embedding space from ESM-1b recapitulated evolutionary trajectories of viral and eukaryotic proteins

Figures

Figures from (1)

Ref (2)

Quartz 4

Explorer

Sequence-based DL models learn intra-family phylogenetics of protein evolution

Summary

Figures

See also

Graph View