Back

Quantum analysis of protein-ligand binding by integrating structural resolution, sequence homology, and ligand properties

Roosan, D.; Samrose, S.; Khan, R.; Nirzhor, S.; Provencher, B.

2025-06-30 molecular biology
10.1101/2025.06.27.661905 bioRxiv
Show abstract

Predicting protein-ligand binding affinity is a fundamental challenge in computational biology and drug discovery, complicated by diverse factors including protein sequence variability, ligand chemical diversity, and structural resolution. Here, we present an integrative study that combines classical machine learning and quantum-enhanced modeling to investigate how crystal structure resolution, sequence similarity, and ligand properties jointly influence binding affinity. Using a curated "refined" dataset from PDBbind and an expanded general dataset, we first conduct correlation and regression analyses to quantify the relationships among binding affinity, ligand descriptors (e.g., molecular weight, logP), and protein structural metrics (resolution, R-factor). We observe moderate positive correlations between ligand size/hydrophobicity and affinity, and a slight negative correlation between resolution and affinity in the refined dataset that largely disappears in the general set. We then train multiple predictive models, including random forests, deep neural networks, and quantum-enhanced approaches--quantum kernel methods, variational quantum circuits, and a hybrid classical-quantum neural network. Experimental results show that quantum-enhanced models perform on par with classical methods in predicting binding affinities and, in some cases, offer modest improvements. Notably, a hybrid quantum-classical model achieves the highest accuracy (Pearson correlation R{approx}0.80R) on the refined dataset. These findings highlight the potential of quantum computing for capturing complex patterns in biomolecular data, laying groundwork for improved structure-based drug design. Our study underscores that while data quality and curation greatly influence observed trends, quantum machine learning despite current hardware limitations can already serve as a competitive and promising tool in computational structural biology. AUTHOR SUMMARYIn our study, we confront a major bottleneck in the creation of new medicines: accurately predicting how strongly a potential drug will attach to its target protein in the body. Getting this right early in the process could save billions of dollars and years of research by preventing dead-ends. We investigated if quantum computing, a new technology that processes information in a fundamentally different way, could provide a better solution. We trained several computer models to predict these binding strengths, comparing standard artificial intelligence (AI) with new models enhanced by quantum machine learning. Our results showed that a hybrid model, which strategically combines classical AI with a quantum-powered component, delivered the most accurate predictions on high-quality, curated data. This work demonstrates that quantum computing is already a competitive tool for real-world biological problems, not just a future possibility. We believe that by further developing these quantum-enhanced approaches, we can create more reliable predictive tools to make the long and difficult search for new drugs faster and more successful.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
26.5%
2
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
6.5%
3
PLOS Computational Biology
1633 papers in training set
Top 5%
6.5%
4
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
3.8%
5
Bioinformatics
1061 papers in training set
Top 5%
3.7%
6
Scientific Reports
3102 papers in training set
Top 34%
3.7%
50% of probability mass above
7
Biology Methods and Protocols
53 papers in training set
Top 0.4%
2.8%
8
PeerJ
261 papers in training set
Top 3%
2.8%
9
Journal of Molecular Biology
217 papers in training set
Top 1%
2.1%
10
PLOS ONE
4510 papers in training set
Top 47%
2.1%
11
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.4%
2.1%
12
Biophysical Journal
545 papers in training set
Top 2%
1.9%
13
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
14
Protein Science
221 papers in training set
Top 0.8%
1.7%
15
Entropy
20 papers in training set
Top 0.1%
1.7%
16
International Journal of Molecular Sciences
453 papers in training set
Top 7%
1.7%
17
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.4%
18
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.3%
1.3%
19
ACS Omega
90 papers in training set
Top 3%
1.0%
20
Genome Biology and Evolution
280 papers in training set
Top 1%
1.0%
21
Communications Biology
886 papers in training set
Top 18%
0.9%
22
Journal of Theoretical Biology
144 papers in training set
Top 1%
0.9%
23
Structure
175 papers in training set
Top 3%
0.9%
24
iScience
1063 papers in training set
Top 26%
0.9%
25
Physical Chemistry Chemical Physics
34 papers in training set
Top 0.6%
0.8%
26
IUCrJ
29 papers in training set
Top 0.3%
0.8%
27
Chemical Science
71 papers in training set
Top 2%
0.8%
28
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.3%
0.7%
29
eLife
5422 papers in training set
Top 58%
0.7%
30
Journal of Molecular Evolution
21 papers in training set
Top 0.4%
0.7%