Back

The selectivity implications of docking libraries with greater and lesser similarities to bio-like molecules

Hall, B. W.; Sakamoto, K.; Huang, X.-P.; Irwin, J. J.; Shoichet, B. K.; Roth, B. L.

2026-02-04 biophysics
10.64898/2026.02.02.703317 bioRxiv
Show abstract

As virtual libraries have expanded into the tens of billions via make-on-demand chemistry, their similarity to metabolites, natural products, and drugs ("bio-like" molecules) has rapidly diminished. Despite this divergence, molecular docking of these ultra-large libraries has yielded molecules at higher experimental hit-rates and with improved affinities. The structural divergence from bio-like space raises the possibility that molecules from these ultra-large libraries have improved selectivity. Just as plausibly, if hit-rates on-target are divorced from similarity to bio-like molecules, so too may be selectivity against off-targets. Here, we test whether docking hits for the 5-HT2A serotonin receptor from ultra-large libraries are more selective than those from smaller and more bio-like "in-stock" libraries. Chemoinformatic similarity predicts that docking actives from the in-stock library have more off-targets than the more chemically novel hits emerging from docking the ultra-large library. This may reflect the bias of the known, however, as when tested experimentally at scale against 318 GPCRs, both 16 agonists from the ultra-large library and 20 actives from the in-stock library had similar numbers of off-targets. While the ultra-large library hits are more sub-type selective for the 5-HT2A over the 5-HT2B and 5-HT2C receptors, overall these results may suggest that selectivity against off-targets, like affinity and hit-rates for on-targets, is divorced from library similarity to bio-like molecules.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
41.0%
2
PLOS ONE
4510 papers in training set
Top 14%
13.0%
50% of probability mass above
3
eLife
5422 papers in training set
Top 19%
4.5%
4
Structure
175 papers in training set
Top 0.7%
3.7%
5
Scientific Reports
3102 papers in training set
Top 40%
3.2%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.8%
7
Chemical Science
71 papers in training set
Top 0.5%
2.7%
8
Frontiers in Chemistry
14 papers in training set
Top 0.1%
2.2%
9
Cell Chemical Biology
81 papers in training set
Top 2%
1.7%
10
ACS Omega
90 papers in training set
Top 2%
1.4%
11
Journal of Medicinal Chemistry
68 papers in training set
Top 0.8%
1.3%
12
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
13
Frontiers in Pharmacology
100 papers in training set
Top 3%
1.3%
14
Molecules
37 papers in training set
Top 1%
0.9%
15
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.8%
0.8%
16
Biomolecules
95 papers in training set
Top 2%
0.8%
17
Biochemistry
130 papers in training set
Top 2%
0.8%
18
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.8%
19
Protein Science
221 papers in training set
Top 2%
0.7%
20
ACS Chemical Biology
150 papers in training set
Top 2%
0.7%
21
ACS Medicinal Chemistry Letters
16 papers in training set
Top 0.6%
0.7%
22
International Journal of Molecular Sciences
453 papers in training set
Top 17%
0.7%
23
RSC Advances
18 papers in training set
Top 2%
0.7%
24
PeerJ
261 papers in training set
Top 17%
0.7%
25
Frontiers in Molecular Biosciences
100 papers in training set
Top 7%
0.5%
26
ACS Pharmacology & Translational Science
40 papers in training set
Top 1%
0.5%