Back

Enhancing Challenging Target Screening via Multimodal Protein-Ligand Contrastive Learning

Wang, Z.; Wang, Z.; Yang, M.; Pang, L.; Nie, F.; Liu, S.; Gao, Z.; Zhao, G.; Ji, X.; Huang, D.; Zhu, Z.; Li, D.; Yuan, Y.; Zheng, H.; Zhang, L.; Ke, G.; Wang, D.; Yu, F.

2024-10-24 bioinformatics
10.1101/2024.08.22.609123 bioRxiv
Show abstract

Recent advancements in genomics and proteomics have identified numerous clinically significant protein targets, with notably 85% categorized as undruggable. These targets present widespread challenges due to their complex structures and dynamics, rendering conventional drug design strategies not always effective. In this study, we introduce Uni-Clip, a contrastive learning framework that incorporates multi-modal features of proteins (structure and residue) and ligands (conformation and graph). Optimized with a specifically designed CF-InfoNCE loss, Uni-Clip enhances the modeling of protein-ligand interactions for both undruggable and druggable proteins. Uni-Clip demonstrates superior performance in benchmark evaluations on widely acknowledged datasets, LIT-PCBA and DUD-E, achieving a 147% and 218% improvements in enrichment factors at 1% compared to baselines. Furthermore, Uni-Clip proves to be a practical tool for various drug discovery applications. In virtual screening for the challenging protein target GPX4 with flat surface, it identified non-covalent inhibitors with an IC50 of 4.17 M, in contrast to the predominantly covalent inhibitors currently known. Through target fishing for benzbromarone, Uni-Clip identified the intrinsically disordered protein c-Myc as a potential target, highlighting benzbromarones potential for repurposing in cancer therapy. Explainable analyses effectively identified binding sites consistent with molecular dynamics and experimental results, even for challenging undruggable targets.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.2%
22.8%
2
Advanced Science
249 papers in training set
Top 0.3%
18.9%
3
Briefings in Bioinformatics
326 papers in training set
Top 0.8%
6.4%
4
Communications Chemistry
39 papers in training set
Top 0.1%
4.9%
50% of probability mass above
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
4.0%
6
Nature Communications
4913 papers in training set
Top 39%
3.6%
7
Nature Machine Intelligence
61 papers in training set
Top 1%
3.1%
8
Molecular & Cellular Proteomics
158 papers in training set
Top 0.8%
2.6%
9
Patterns
70 papers in training set
Top 0.8%
1.7%
10
Bioinformatics
1061 papers in training set
Top 8%
1.5%
11
Journal of Proteome Research
215 papers in training set
Top 1%
1.5%
12
PLOS Computational Biology
1633 papers in training set
Top 18%
1.3%
13
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
14
Chemical Science
71 papers in training set
Top 1%
1.2%
15
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.2%
16
Communications Biology
886 papers in training set
Top 17%
1.0%
17
Scientific Reports
3102 papers in training set
Top 71%
0.9%
18
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.9%
19
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.9%
20
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
21
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 6%
0.8%
22
Journal of Medicinal Chemistry
68 papers in training set
Top 1%
0.8%
23
PLOS ONE
4510 papers in training set
Top 66%
0.8%
24
Molecules
37 papers in training set
Top 2%
0.8%
25
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
26
The Journal of Physical Chemistry Letters
58 papers in training set
Top 2%
0.8%
27
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
28
JACS Au
35 papers in training set
Top 1%
0.7%
29
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
30
iScience
1063 papers in training set
Top 40%
0.5%