Back

Integrating GWAS and Transcriptome Data through PrediXcan and Multimodal Deep Learning to Reveal Genetic Basis and Novel Drug Repositioning Opportunities for Alzheimer's Disease

Tian, X.; Su, Y.; Zhang, S.; Sun, H.; Chen, C.; Chen, C.; Zhou, L.; Lv, X.; Roussos, P.; Zhang, W.

2025-01-02 genetic and genomic medicine
10.1101/2025.01.02.25319880 medRxiv
Show abstract

Alzheimers disease (AD), the leading cause of dementia, imposes a significant societal and economic burden; however, its complex molecular mechanisms remain unclear. This study integrates multi-omics data with advanced artificial intelligence (AI) methods to uncover the molecular basis underlying AD phenotype regulation and explore personalized drug repositioning strategies based on individual genetic backgrounds. First, we applied the PrediXcan method to identify candidate genes closely associated with AD cognitive diagnosis, selecting from 61 brain-related traits. We validated these findings through individual-level analysis using gene expression and genotype data from 553 dorsolateral prefrontal cortex samples in the ROSMAP database. Simultaneously, we constructed a deep, multi-layer information fusion model (AD-MIF) by integrating genotype and gene expression data and employing autoencoders as well as graph autoencoders for multi-modal feature extraction. The results revealed a 10-20% improvement in the Area Under the Curve (AUC) for predicting AD-related phenotypes. Both approaches showed high consistency across cellular structures, brain regions, and neurobiological pathways, demonstrating their complementary advantages. Gene enrichment analysis indicated that APOE and its interacting gene APOC1 play a central role in cholesterol metabolism, lipid transport, and immune regulation, while genes such as SCIMP and KAT8 are involved in immune signaling, epigenetic regulation, and neuroprotection. After incorporating attention mechanisms, AD-MIF highlighted the importance of key genes, such as POLR2C and TRAPPC4, in regulating neuronal function. Based on predictive results and enrichment analysis, we further identified candidate drugs, including sirolimus, dasatinib, and MGCD-265. In vivo experiments confirmed that MGCD-265, also known as Glesatinib, and dasatinib significantly improve cognitive deficits in the SAMP8 AD model mice by inhibiting neuroinflammation, pathological tau phosphorylation, and A{beta} deposition. This study demonstrates the complementary advantages of bioinformatics pipelines and AI-based multi-modal fusion methods in elucidating the complex pathological mechanisms of AD and enhancing phenotype prediction accuracy. It also provides new theoretical support for personalized drug interventions based on individual genetic characteristics, laying a solid foundation for optimizing early screening, prediction, and personalized treatment strategies. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=113 SRC="FIGDIR/small/25319880v2_ufig1.gif" ALT="Figure 1"> View larger version (43K): org.highwire.dtl.DTLVardef@156b7c1org.highwire.dtl.DTLVardef@3a1e23org.highwire.dtl.DTLVardef@190a284org.highwire.dtl.DTLVardef@9d963f_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 13 journals account for 50% of the predicted probability mass.

1
Alzheimer's Research & Therapy
52 papers in training set
Top 0.1%
10.4%
2
Briefings in Bioinformatics
326 papers in training set
Top 0.6%
7.4%
3
Alzheimer's & Dementia
143 papers in training set
Top 1%
4.4%
4
Advanced Science
249 papers in training set
Top 4%
4.1%
5
Computers in Biology and Medicine
120 papers in training set
Top 0.8%
3.7%
6
Frontiers in Aging Neuroscience
67 papers in training set
Top 1%
3.2%
7
iScience
1063 papers in training set
Top 7%
3.0%
8
eLife
5422 papers in training set
Top 31%
2.8%
9
Nature Communications
4913 papers in training set
Top 44%
2.7%
10
Signal Transduction and Targeted Therapy
29 papers in training set
Top 0.5%
2.4%
11
Biosensors and Bioelectronics
52 papers in training set
Top 0.5%
2.4%
12
International Journal of Molecular Sciences
453 papers in training set
Top 5%
2.1%
13
Communications Biology
886 papers in training set
Top 5%
2.1%
50% of probability mass above
14
Aging Cell
144 papers in training set
Top 2%
2.1%
15
Cell & Bioscience
14 papers in training set
Top 0.1%
1.9%
16
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.8%
17
Scientific Reports
3102 papers in training set
Top 61%
1.5%
18
Frontiers in Genetics
197 papers in training set
Top 5%
1.5%
19
Heliyon
146 papers in training set
Top 3%
1.4%
20
Cell Proliferation
12 papers in training set
Top 0.2%
1.1%
21
Biomedicines
66 papers in training set
Top 2%
1.1%
22
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
1.1%
23
Molecular Psychiatry
242 papers in training set
Top 3%
1.0%
24
Genomics
60 papers in training set
Top 2%
1.0%
25
National Science Review
22 papers in training set
Top 1%
1.0%
26
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
0.9%
27
Journal of Alzheimer’s Disease
39 papers in training set
Top 0.9%
0.9%
28
npj Aging
15 papers in training set
Top 0.7%
0.9%
29
Cell Death & Disease
126 papers in training set
Top 2%
0.8%
30
NeuroImage
813 papers in training set
Top 5%
0.8%