Back

Predicting Degradation Potential of Protein Targeting Chimeras

Petrou, A.; Minhas, F.

2024-09-19 bioinformatics
10.1101/2024.09.16.613208 bioRxiv
Show abstract

PRoteolysis TArgeting Chimeras (PROTACs) can inhibit protein activity by utilizing natural proteasomal degradation pathways for the degradation of target proteins. Being able to determine the degradation potential of PROTACs is crucial in drug development as it can lead to time, labor and cost savings. In this paper, we present a novel machine-learning pipeline that utilizes common compound fingerprints and a pre-trained graph neural network for the prediction of half-maximal degradation concentration of PROTACs by benchmarking a variety of protein tertiary structures and chemical features. Based on critical analysis of our cross-validation and independent test results, we have highlighted several key challenges underlying this prediction problem that need to be addressed to improve the generalization of predictive models in this domain. Moreover, we demonstrate the effectiveness of our approach by testing it on two different datasets and show that it performs better than the current state of the art with an AUC-ROC of 0.85 and accuracy of 0.875 on the DeepPROTACs test dataset.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.2%
22.2%
2
Journal of Cheminformatics
25 papers in training set
Top 0.1%
14.5%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.3%
8.1%
4
PLOS ONE
4510 papers in training set
Top 29%
6.2%
50% of probability mass above
5
Molecules
37 papers in training set
Top 0.1%
4.3%
6
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.9%
7
Bioinformatics
1061 papers in training set
Top 5%
3.5%
8
PLOS Computational Biology
1633 papers in training set
Top 11%
3.0%
9
Scientific Reports
3102 papers in training set
Top 42%
3.0%
10
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
2.8%
11
Bioinformatics Advances
184 papers in training set
Top 2%
1.9%
12
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.5%
13
Communications Chemistry
39 papers in training set
Top 0.4%
1.3%
14
International Journal of Molecular Sciences
453 papers in training set
Top 10%
1.3%
15
Pharmaceuticals
33 papers in training set
Top 1%
1.1%
16
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
17
ACS Omega
90 papers in training set
Top 4%
0.7%
18
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%
19
Advanced Science
249 papers in training set
Top 22%
0.6%
20
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.8%
0.6%
21
Metabolites
50 papers in training set
Top 2%
0.6%
22
Frontiers in Pharmacology
100 papers in training set
Top 6%
0.6%
23
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.7%
0.6%
24
Patterns
70 papers in training set
Top 3%
0.6%
25
Journal of Medicinal Chemistry
68 papers in training set
Top 1%
0.6%
26
Biochemistry and Biophysics Reports
28 papers in training set
Top 2%
0.6%