Back

Disentangling the contribution of disease genes to drug therapeutic and side effects

Lalagkas, P. N.; Melamed, R. D.

2026-05-05 genetic and genomic medicine
10.64898/2026.05.04.26352378 medRxiv
Show abstract

Most clinical trials fail due to either lack of efficacy or safety concerns. Human genetics can address both failure reasons: disease-associated genes are not only promising therapeutic targets but also predict drug side effects. However, because the same genetic signal underlies both outcomes, we need methods that disentangle which disease genes mediate therapeutic benefit versus adverse side effects. We use DraphNet, our previously developed model that maps drug molecular effects onto disease genes to generate two gene sets per drug: one linked to its therapeutic effects (IND genes) and one linked to its side effects (SE genes). We show that IND and SE genes overlap for 76% of the tested drugs (compared to a null model). We also show that drugs sharing greater IND similarity also have greater SE similarity ({rho}=0.57, p<1e-300). To show how our approach enables insights into drug biology, we construct groupings of drugs based on their IND and SE genes. We find that drugs in the same IND grouping are enriched for co-occurrence in the same SE grouping (OR=212.37). We present two examples to illustrate the kind of insights this network enables: identification of drugs with shared IND but distinct SE genes as repurposing candidates, and identification of drugs with shared SE but distinct IND genes to assist treatment selection in patients with comorbidities. Finally, we develop a neural network that directly links drug molecular effects onto disease genes and learns a gene-level score that quantifies each genes relative contribution to drug therapeutic versus side effects on disease.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 3%
10.2%
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.3%
10.2%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 10%
6.9%
4
Cell Systems
167 papers in training set
Top 2%
4.9%
5
PLOS Computational Biology
1633 papers in training set
Top 7%
4.9%
6
Scientific Reports
3102 papers in training set
Top 27%
4.4%
7
npj Digital Medicine
97 papers in training set
Top 1%
4.0%
8
Nature Human Behaviour
85 papers in training set
Top 0.9%
3.6%
9
Nature Genetics
240 papers in training set
Top 2%
3.6%
50% of probability mass above
10
eLife
5422 papers in training set
Top 25%
3.6%
11
Genome Medicine
154 papers in training set
Top 2%
3.6%
12
Bioinformatics Advances
184 papers in training set
Top 2%
2.8%
13
Nature Communications
4913 papers in training set
Top 45%
2.6%
14
Communications Medicine
85 papers in training set
Top 0.1%
2.1%
15
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.9%
16
iScience
1063 papers in training set
Top 14%
1.7%
17
PLOS ONE
4510 papers in training set
Top 53%
1.7%
18
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.5%
19
Cell Genomics
162 papers in training set
Top 4%
1.3%
20
BMC Medical Genomics
36 papers in training set
Top 0.6%
1.3%
21
Nature Medicine
117 papers in training set
Top 3%
1.2%
22
Frontiers in Pharmacology
100 papers in training set
Top 3%
1.2%
23
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.9%
24
Journal of Clinical Investigation
164 papers in training set
Top 6%
0.8%
25
Science
429 papers in training set
Top 19%
0.8%
26
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
27
Nature
575 papers in training set
Top 16%
0.8%
28
Evolution
199 papers in training set
Top 2%
0.7%
29
Molecular Systems Biology
142 papers in training set
Top 2%
0.7%
30
Cancer Research
116 papers in training set
Top 4%
0.5%