Back

A Multi-Modal AI/ML-based Framework for Protein Conformation Selection and Prediction in Drug Discovery Applications

Gupta, S.; Menon, V.; Baudry, J.

2026-02-18 molecular biology
10.64898/2026.02.17.706293 bioRxiv
Show abstract

The development of pharmaceutical drugs is a time-intensive and costly process, with more than 90% of drug candidates failing during preclinical or clinical testing. A major challenge lies in accurately predicting protein-ligand interactions, especially given that traditional computational methods often rely on a single protein conformation, failing to capture biologically relevant structural variability. To address this, we present an AI/ML-based multi-modal framework based on Graph Convolutional Network (GCN) that integrates both global and local protein descriptors to classify binding and non-binding conformations more effectively. Global descriptors capture overarching physico-chemical and structural properties of proteins, while local descriptors--such as pharmacophores--provide site-specific information crucial for modeling ligand interactions. Our GCN based approach demonstrates that integrating local and global structural perspectives significantly improves predictive accuracy and robustness. By enabling more reliable protein conformation classification, this work contributes toward scalable, AI-driven drug discovery--an increasingly critical goal in response to global health challenges.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.2%
23.0%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.5%
6.5%
3
PLOS ONE
4510 papers in training set
Top 26%
6.5%
4
Scientific Reports
3102 papers in training set
Top 17%
6.5%
5
Journal of Cheminformatics
25 papers in training set
Top 0.1%
4.1%
6
Briefings in Bioinformatics
326 papers in training set
Top 2%
4.1%
50% of probability mass above
7
Nature Machine Intelligence
61 papers in training set
Top 0.8%
3.7%
8
Computers in Biology and Medicine
120 papers in training set
Top 0.8%
3.7%
9
Advanced Science
249 papers in training set
Top 7%
2.7%
10
Communications Biology
886 papers in training set
Top 3%
2.7%
11
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
12
Bioinformatics
1061 papers in training set
Top 7%
1.7%
13
International Journal of Molecular Sciences
453 papers in training set
Top 7%
1.7%
14
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
15
Nature Communications
4913 papers in training set
Top 50%
1.7%
16
iScience
1063 papers in training set
Top 15%
1.7%
17
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
18
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.4%
19
Biology Methods and Protocols
53 papers in training set
Top 1%
1.3%
20
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.0%
21
ACS Omega
90 papers in training set
Top 4%
0.8%
22
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.7%
0.7%
23
Science Bulletin
22 papers in training set
Top 0.9%
0.7%
24
Genome Medicine
154 papers in training set
Top 9%
0.7%
25
Molecules
37 papers in training set
Top 2%
0.7%
26
Communications Chemistry
39 papers in training set
Top 2%
0.5%
27
eLife
5422 papers in training set
Top 63%
0.5%
28
Biomolecules
95 papers in training set
Top 3%
0.5%
29
Journal of Molecular Biology
217 papers in training set
Top 5%
0.5%
30
Entropy
20 papers in training set
Top 0.6%
0.5%