Back

Does the sequence of a disordered protein encode small molecule binding paths?

Louet, A. A. B.; Hummer, G.; Vendruscolo, M.

2026-05-23 biophysics
10.64898/2026.05.20.726646 bioRxiv
Show abstract

Ligand binding to intrinsically disordered proteins resists description in terms of conventional binding pockets, yet it can be analysed as a dynamic process in which ligands move across transient surface interaction sites. Here we characterise a pathway-based representation in which ligand binding is described as a sequence of transitions between residue-defined microstates, enabling ligand-specific effects to be distinguished from intrinsic properties of the peptide conformational ensemble. Using all-atom molecular dynamics simulations of A{beta}42 and the C-terminal region of -synuclein in complex with chemically diverse small molecules, we construct transition matrices that encode ligand movement across the peptide surface and use Markov state models to identify dominant binding pathways and relative binding propensities. Pairwise enrichment-factor and AUC analyses reveal strong conservation of the highest-ranked pathways across chemically diverse ligands, with enrichment factors of 15-45 for the top-ranked states and AUC values typically [≥]0.75, well above random expectation. These dominant pathways are also preserved across changes in pH and temperature, whereas a urea control, included as a non-specific binder, shows reduced enrichment, indicating that ligands primarily modulate pathway weights rather than define the underlying network topology. Ensemble docking across chemically diverse libraries further supports the presence of recurrent ligand-accessible hotspots within the peptide conformational ensemble. Building on this framework, we apply a prospective screening pipeline to A{beta}42, combining MSM-derived hotspots with sequence-based Ligand-Transformer scoring and Gnina docking across 1.66 million compounds, to nominate 19 candidates for prospective experimental evaluation. Together, these results indicate that disordered protein sequences give rise to conformational ensembles that exhibit characteristic binding pathways for small molecules.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
28.2%
2
Chemical Science
71 papers in training set
Top 0.1%
18.9%
3
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.2%
4.9%
50% of probability mass above
4
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 16%
4.4%
5
Journal of the American Chemical Society
199 papers in training set
Top 2%
4.0%
6
Protein Science
221 papers in training set
Top 0.4%
3.7%
7
The Journal of Physical Chemistry B
158 papers in training set
Top 0.5%
3.7%
8
Biophysical Journal
545 papers in training set
Top 2%
2.1%
9
PLOS Computational Biology
1633 papers in training set
Top 14%
1.9%
10
The Journal of Physical Chemistry Letters
58 papers in training set
Top 0.7%
1.9%
11
Journal of Molecular Biology
217 papers in training set
Top 1%
1.9%
12
Scientific Reports
3102 papers in training set
Top 56%
1.7%
13
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
14
Structure
175 papers in training set
Top 2%
1.5%
15
Nature Communications
4913 papers in training set
Top 54%
1.5%
16
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.2%
17
eLife
5422 papers in training set
Top 49%
1.2%
18
PLOS ONE
4510 papers in training set
Top 62%
1.0%
19
Angewandte Chemie International Edition
81 papers in training set
Top 3%
0.9%
20
Communications Chemistry
39 papers in training set
Top 1%
0.8%
21
JACS Au
35 papers in training set
Top 1%
0.7%
22
Advanced Science
249 papers in training set
Top 22%
0.7%
23
Nucleic Acids Research
1128 papers in training set
Top 21%
0.5%
24
Journal of Medicinal Chemistry
68 papers in training set
Top 2%
0.5%