Back

DESPOT: Direction-Enhanced Scoring POTentials

Poelmans, R.; Bruncsics, B.; Arany, A.; Van Eynde, W.; Shemy, A.; Moreau, Y.; Voet, A. R.

2026-04-02 bioinformatics
10.64898/2026.03.31.714140 bioRxiv
Show abstract

Knowledge-based potentials (KBPs) have long been used to score protein-ligand interactions, yet existing formulations remain isotropic, capturing only distance dependencies and neglecting the directional preferences that govern molecular recognition. Here, we introduce Direction-Enhanced Scoring POTentials (DESPOT), an anisotropic knowledge-based framework that unifies pose scoring and binding-site characterisation within a single probabilistic model. The new probabilistic formulation used in DESPOT naturally supports directional modelling through atom type-specific local reference frames and symmetry-aware geometric discretisation. It also supports steric exclusion, encoded as a dedicated void state that explicitly captures the probability that a spatial bin remains unoccupied. The anisotropic interaction profiles learned by DESPOT reveal systematic directional preferences for interactions such as hydrogen bonds, aromatic interactions, and halogen bonds, that extend beyond idealised geometric models. Evaluation on the CASF-2016 benchmark shows that DESPOT sub-stantially outperforms isotropic KBPs in all pose-discrimination and virtual screening tasks (p << 0.0001 for all enrichment factors), with the largest gains arising from its ability to penalise geometrically implausible poses. Constrained energy minimisation of training structures proves strongly beneficial for the derivation of KBPs, while our train-test leakage analysis reveals that overfitting is an underestimated and understudied issue for KBPs. DESPOT provides a data-driven framework for direction-aware modelling of protein-ligand interactions, with applications in pose scoring, binding-site characterisation, and structure-based design.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
38.8%
2
Nature Communications
4913 papers in training set
Top 21%
8.6%
3
Bioinformatics
1061 papers in training set
Top 3%
8.6%
50% of probability mass above
4
Advanced Science
249 papers in training set
Top 4%
4.1%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
2.8%
6
Journal of Cheminformatics
25 papers in training set
Top 0.2%
2.5%
7
Nucleic Acids Research
1128 papers in training set
Top 8%
2.1%
8
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
9
Scientific Reports
3102 papers in training set
Top 49%
2.1%
10
Cell Systems
167 papers in training set
Top 6%
1.9%
11
PLOS Computational Biology
1633 papers in training set
Top 14%
1.9%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 31%
1.7%
13
Chemical Science
71 papers in training set
Top 0.9%
1.7%
14
Nature Methods
336 papers in training set
Top 4%
1.5%
15
PLOS ONE
4510 papers in training set
Top 56%
1.5%
16
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.6%
1.3%
17
The Journal of Physical Chemistry Letters
58 papers in training set
Top 1%
1.1%
18
Bioinformatics Advances
184 papers in training set
Top 4%
1.0%
19
The Journal of Physical Chemistry B
158 papers in training set
Top 2%
1.0%
20
Communications Biology
886 papers in training set
Top 18%
0.9%
21
Communications Chemistry
39 papers in training set
Top 0.9%
0.8%
22
Nature Biotechnology
147 papers in training set
Top 7%
0.7%