Back

INSIGHT: In Silico Drug Screening Platform using Interpretable Deep Learning Network

Shi, X.; Ramathal, C.; Dezso, Z.

2025-04-24 bioinformatics
10.1101/2025.04.21.649855 bioRxiv
Show abstract

The large-scale multiplexed drug screening platforms like PRISM and GDSC facilitate the screening of drug treatments over 1,000 cancer cell lines. The cancer cell lines are well characterized by multiomics screening in CCLE and DepMap, enabling the application of AI and machine learning techniques to study the association between drug sensitivity and the underlying molecular profiles. The large scale and variety of data modalities enabled us to build an interpretable deep learning framework, INSIGHT, integrating the multiomics data and the drugs molecular structure to predict drug response. We trained our model on the PRISM screen for single treatments and on the DrugComb screen database for combination treatments. Our method enables the in-silico extension of current screens by predicting drug response in cancer cell lines not included in the screen, as well as the drug response to novel single agent or combination therapy by leveraging the drugs molecular structure. Furthermore, the deep learning framework was built to enable biological interpretation. The connections between the hidden layers of the neural network incorporate prior biological knowledge such as signaling pathways. This enables the model, in addition to predicting the drug sensitivity profiles, to prioritize the pathways predictive of drug response and identify pathways related to the mechanism of action (MOA) and potential off target effects of novel drugs. The evaluation of our model using cross-validation on the PRISM and DrugComb dataset showed an improved performance compared to previously developed biologically informed deep learning methods and traditional state of the art machine learning methods like elastic-net and XGBoost. We illustrated with examples the value of incorporating biological knowledge into INSIGHT by relating the pathway activity of the predictive models to the MOA.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.3%
2
npj Systems Biology and Applications
99 papers in training set
Top 0.2%
6.3%
3
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
6.3%
4
Scientific Reports
3102 papers in training set
Top 19%
6.3%
5
Patterns
70 papers in training set
Top 0.1%
6.3%
6
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
4.8%
7
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.3%
8
Nature Machine Intelligence
61 papers in training set
Top 0.8%
3.9%
50% of probability mass above
9
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
10
iScience
1063 papers in training set
Top 5%
3.6%
11
PLOS Computational Biology
1633 papers in training set
Top 10%
3.6%
12
Nature Communications
4913 papers in training set
Top 42%
3.2%
13
Journal of Cheminformatics
25 papers in training set
Top 0.2%
2.1%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
15
Genome Medicine
154 papers in training set
Top 5%
1.3%
16
PLOS ONE
4510 papers in training set
Top 59%
1.3%
17
BMC Bioinformatics
383 papers in training set
Top 5%
1.2%
18
Communications Biology
886 papers in training set
Top 17%
0.9%
19
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
0.9%
20
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
21
npj Digital Medicine
97 papers in training set
Top 3%
0.9%
22
GigaScience
172 papers in training set
Top 3%
0.8%
23
Advanced Science
249 papers in training set
Top 18%
0.8%
24
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
25
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
26
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%
27
Communications Chemistry
39 papers in training set
Top 1%
0.7%
28
Metabolites
50 papers in training set
Top 1%
0.7%
29
BioData Mining
15 papers in training set
Top 1%
0.7%
30
Frontiers in Pharmacology
100 papers in training set
Top 5%
0.6%