Back

DeepDrug2: A Germline-focused Graph Neural Network Framework for Alzheimer's Drug Repurposing Validated by Electronic Health Records

Li, V. O. K.; Han, Y.; Lam, J. C. K.; Downey, J.

2025-05-14 health informatics
10.1101/2025.05.13.25327570 medRxiv
Show abstract

Alzheimers disease (AD) is a complex neurodegenerative disorder with limited therapeutic options. The original DeepDrug framework by Li et al. (2025) relied on somatic mutation data and emphasized long genes to guide AD drug repurposing. However, emerging evidence suggests that germline genetic variants play a more central role in AD pathogenesis. In response, we develop DeepDrug2, an enhanced AI-driven framework for AD drug repurposing centered on germline mutations and validated using real-world electronic health records (EHRs). DeepDrug2 introduces four major innovations. First, it proposes a different hypothesis prioritizing germline over somatic mutations in influencing AD risk. Second, it updates the signed directed heterogeneous biomedical graph by removing somatic mutations, long genes, and expert-led genes from the previous version, and incorporating new genes identified in recent genome-wide association study (GWAS) findings. Third, it generates a new list of drug candidates by encoding this updated graph into a new embedding space via a graph neural network (GNN) and calculating drug-gene scores. Fourth, it performs real-world clinical validation using EHR data from over 500,000 individuals (including more than 4,000 AD cases) in the UK Biobank, evaluating associations between drug usage and AD onset while controlling for demographic and comorbidity factors. DeepDrug2 has identified several promising drug candidates. Among the top 15 candidates with sufficient medication records to support statistically powered analysis, Amlodipine (a calcium channel blocker), Indapamide (a thiazide-like diuretic), and Atorvastatin (a statin) were significantly associated with reduced AD risk (p < 0.05). These findings highlight the role of germline mutations in guiding AD drug repurposing and emphasize the value of integrating real-world clinical data into AI-driven drug discovery. To further validate these candidates, future work will involve experimental studies using mouse and zebrafish models of AD. DeepDrug2 offers a promising strategy to support future clinical studies and expand therapeutic options for AD. Future work will evolve DeepDrug2 into a more powerful, versatile, and precise tool for AI-driven drug repurposing in neurodegenerative diseases by deeply integrating advanced LLM capabilities, prioritizing critical disease mechanisms, including tau pathology, and holistically incorporating multi-modal data sources.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.9%
2
Frontiers in Aging Neuroscience
67 papers in training set
Top 0.3%
10.2%
3
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
9.3%
4
Alzheimer's Research & Therapy
52 papers in training set
Top 0.4%
4.9%
5
npj Digital Medicine
97 papers in training set
Top 0.9%
4.9%
6
The Journal of Prevention of Alzheimer's Disease
10 papers in training set
Top 0.1%
3.6%
7
Neurobiology of Aging
95 papers in training set
Top 0.8%
3.6%
50% of probability mass above
8
Patterns
70 papers in training set
Top 0.2%
3.6%
9
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
10
Scientific Reports
3102 papers in training set
Top 50%
2.1%
11
NeuroImage: Clinical
132 papers in training set
Top 2%
1.9%
12
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.8%
13
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.7%
14
GeroScience
97 papers in training set
Top 0.9%
1.7%
15
Nature Communications
4913 papers in training set
Top 51%
1.7%
16
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.3%
17
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
1.2%
18
Annals of Neurology
57 papers in training set
Top 1%
1.2%
19
Experimental Neurology
57 papers in training set
Top 0.9%
1.2%
20
Communications Medicine
85 papers in training set
Top 0.5%
1.2%
21
Journal of Personalized Medicine
28 papers in training set
Top 0.8%
1.0%
22
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.5%
1.0%
23
PLOS ONE
4510 papers in training set
Top 62%
1.0%
24
Alzheimer's & Dementia
143 papers in training set
Top 2%
0.9%
25
Human Brain Mapping
295 papers in training set
Top 4%
0.8%
26
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
27
Alzheimer's & Dementia: Translational Research & Clinical Interventions
16 papers in training set
Top 0.6%
0.8%
28
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
29
Brain Communications
147 papers in training set
Top 3%
0.8%
30
iScience
1063 papers in training set
Top 31%
0.8%