Summary

Deep learning models such as DiffDock are SOTA at finding pockets but are outperformed by traditional methods at getting the ligand pose right (1). These results are in ground truth pockets and do not look at ESMFold pockets. This is suspected to be due to their lack of inductive biases for identifying interactions that drive high affinity (2).

Figures

Ref (2)

MethodTop-1 RMSD(Å) % < 1Å (↑)Top-1 RMSD(Å) % < 2Å (↑)Top-1 RMSD(Å) Med. (↓)Top-5 RMSD(Å) % < 1Å (↑)Top-5 RMSD(Å) % < 2Å (↑)Top-5 RMSD(Å) Med. (↓)
Deep LearningEquiBind-5.5 ± 1.26.2 ± 0.3---
TANKBind20.4 ± 2.14.0 ± 0.224.5 ± 2.13.4 ± 0.1
TANKBind*2.66 ± 0.2618.18 ± 0.64.2 ± 0.054.13 ± 0.020.39 ± 0.453.5 ± 0.04
DiffDock38.2 ± 2.53.30 ± 0.344.7 ± 2.62.40 ± 0.2
DiffDock*15.41 ± 0.4936.62 ± 0.353.31 ± 0.0321.58 ± 0.3844.19 ± 0.492.37 ± 0.06
TraditionalFpocket + Uni-dock13.33 ± 0.418.7 ± 0.1313.2 ± 0.2619.16 ± 0.3927.32 ± 0.698.3 ± 0.25
P2Rank + Uni-dock19.31 ± 1.0728.6 ± 1.176.4 ± 0.2227.76 ± 1.0339.18 ± 1.033.76 ± 0.06
PointSite + Uni-dock21.36 ± 1.6532.12 ± 0.935.54 ± 0.4631.38 ± 0.8646.06 ± 0.692.52 ± 0.18
Better Pocket + TraditionalDiffDock* + Uni-dock25.49 ± 0.638.93 ± 0.234.14 ± 0.0736.97 ± 1.0551.07 ± 1.061.93 ± 0.12
GT pocket + Uni-dock32.77 ± 0.3851.11 ± 0.61.89 ± 0.0447.5 ± 0.2367.59 ± 0.941.11 ± 0.02
Ref (1)

See also

1.
Yu Y, Lu S, Gao Z, Zheng H, Ke G. Do Deep Learning Models Really Outperform Traditional Approaches in Molecular Docking? 2023; Available from: https://arxiv.org/abs/2302.07134
2.
Errington D, Schneider C, Bouysset C, Dreyer FA. Assessing interaction recovery of predicted protein-ligand poses. Journal of Cheminformatics. 2025;17(1). Available from: https://doi.org/10.1186/s13321-025-01011-6