Back

Condition-matched in silico prediction of drug transcriptional responses enables mechanism-guided screening and combination discovery

Xiao, M.; He, Y.; Hu, J.; Zou, F.; Zou, B.

2026-03-31 bioinformatics
10.64898/2026.03.27.714886 bioRxiv
Show abstract

Perturbational transcriptomics links therapeutic compounds to cellular mechanisms and provides a powerful framework for drug discovery, but experimentally profiling transcriptional responses across diverse cell states, doses and durations is costly and often infeasible. Here we present DEPICT (Drug rEsponse Prediction in transCriptomics with Transformers), a deep learning framework that predicts condition-matched drug-induced transcriptional responses from baseline gene expression, perturbation settings and complementary drug representations. Using the LINCS L1000 dataset, DEPICT generalized to unseen drugs and cell types and outperformed five baseline strategies and two recent deep learning models. In the most challenging unseen-cell evaluation, DEPICT was the only model to surpass all baselines, improving differential-expression prediction accuracy and reducing perturbed-expression prediction error by 30.3% and 36.8%, respectively, relative to the next-best deep model. In a non-small cell lung cancer (NSCLC) case study, DEPICT-enabled virtual screening prioritized compounds predicted to reverse disease-associated transcriptional signatures. Notably, 13 of the top 20 prioritized compounds had either previously entered NSCLC-related clinical trials or been validated in NSCLC studies, supporting the translational relevance of the predicted perturbational profiles. DEPICT further enabled condition-matched drug synergy prediction and mechanistic exploration when experimentally matched profiles were unavailable. Together, these results show that accurate, condition-matched in silico perturbation profiling can scale transcriptomics-driven hypothesis generation for drug repurposing and combination discovery.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 8%
17.7%
2
Cell Systems
167 papers in training set
Top 0.9%
10.5%
3
Nature Machine Intelligence
61 papers in training set
Top 0.2%
8.5%
4
Genome Medicine
154 papers in training set
Top 1%
4.9%
5
Advanced Science
249 papers in training set
Top 4%
4.3%
6
Nature Methods
336 papers in training set
Top 2%
4.0%
7
Nature Biomedical Engineering
42 papers in training set
Top 0.2%
3.7%
50% of probability mass above
8
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.1%
9
Cell Reports Medicine
140 papers in training set
Top 2%
2.5%
10
PLOS Computational Biology
1633 papers in training set
Top 13%
2.4%
11
Cell Genomics
162 papers in training set
Top 2%
2.1%
12
Cancer Research
116 papers in training set
Top 1%
2.1%
13
Nature Biotechnology
147 papers in training set
Top 4%
2.1%
14
Nucleic Acids Research
1128 papers in training set
Top 9%
1.9%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 30%
1.8%
16
Bioinformatics
1061 papers in training set
Top 7%
1.8%
17
Communications Biology
886 papers in training set
Top 11%
1.5%
18
Nature Chemical Biology
104 papers in training set
Top 2%
1.3%
19
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.2%
20
Nature Genetics
240 papers in training set
Top 6%
1.0%
21
Patterns
70 papers in training set
Top 2%
1.0%
22
Science Advances
1098 papers in training set
Top 26%
0.9%
23
Science
429 papers in training set
Top 18%
0.9%
24
PLOS ONE
4510 papers in training set
Top 66%
0.8%
25
Scientific Reports
3102 papers in training set
Top 73%
0.8%
26
Cancer Cell
38 papers in training set
Top 2%
0.8%
27
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
28
Nature
575 papers in training set
Top 15%
0.8%
29
Genome Biology
555 papers in training set
Top 8%
0.7%
30
eLife
5422 papers in training set
Top 61%
0.6%