Back

Multi-Stage Graph Attention Networks for Interpretable Alzheimer's Disease Classification from Genome-Wide Association Data

Saxena, A.; Gaiteri, C.; Faraone, S. V.

2026-04-09 neuroscience
10.64898/2026.04.06.716790 bioRxiv
Show abstract

BackgroundGenome-wide association studies have identified numerous variants associated with neuropsychiatric disorders. Although some significant loci can carry substantial risk, as in Alzheimers Disease, the remaining genetic variance is distributed across many small-effect loci. Polygenic risk scores (PRS) aggregate this risk but do not capture epistatic interactions, and offer limited biological interpretability and predictive accuracy. Computing gene level risk scores and integrating known or statistically validated gene-gene associations has the potential to increase interpretability and/or accuracy. Graph Neural Networks (GNNs) can leverage graph structured genetic data that models potential epistatic interactions to achieve these goals. MethodsWe developed a three-stage Graph Attention Network (GAT) classifier using individual-level GWAS data from 7,358 participants across seven Alzheimers Disease Center cohorts. Nodes were defined as genes, with risk scores from AD and 11 genetically correlated phenotypes serving as features. We evaluated two graph construction strategies: gene co-expression networks derived from hippocampal transcriptomic data and curated pathway-based graphs. Additionally, a bilinear context module was incorporated to capture global gene-gene interactions beyond the graph topology. In Stage 1, a GNN encoder was trained on the graphs; Stage 2 injected PRS for non-coding SNPs after the encoder to better capture genetic risk via transfer learning, and Stage 3 applied adversarial training with gradient reversal for ancestry debiasing. GNN predictions were ensembled with whole-genome PRS using elastic net regression. ResultsThe best-performing GNN model -- a GAT with bilinear context operating on the pathway graph -- achieved an AUROC of 0.78 (95% CI: 0.75-0.80). Ensemble models combining Stage 2 or 3 GNN logits with whole-genome PRS achieved an AUROC of 0.82 (0.79-0.84), outperforming PRS alone (0.80). GxI attribution and additional explainability analyses revealed stage-specific biological signals, some of which re-capitulated known gene-phenotype associations and others which may reflect potential new areas of inquiry. ConclusionA multi-stage GAT framework captures complementary, non-additive genetic signal that, when ensembled with PRS, improves the accuracy of AD classification. Post-hoc explainability analyses yield biologically interpretable gene networks, supporting the utility of graph-based deep learning for dissecting complex genetic architectures.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Alzheimer's & Dementia
143 papers in training set
Top 0.1%
40.8%
2
Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring
38 papers in training set
Top 0.2%
6.6%
3
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
50% of probability mass above
4
Scientific Reports
3102 papers in training set
Top 33%
3.7%
5
Bioinformatics
1061 papers in training set
Top 7%
2.0%
6
Alzheimer's & Dementia: Translational Research & Clinical Interventions
16 papers in training set
Top 0.3%
2.0%
7
Frontiers in Aging Neuroscience
67 papers in training set
Top 2%
1.8%
8
Human Brain Mapping
295 papers in training set
Top 3%
1.7%
9
Nature Communications
4913 papers in training set
Top 51%
1.7%
10
Brain Communications
147 papers in training set
Top 2%
1.5%
11
Journal of Alzheimer’s Disease
39 papers in training set
Top 0.7%
1.5%
12
Brain
154 papers in training set
Top 3%
1.4%
13
PLOS ONE
4510 papers in training set
Top 59%
1.3%
14
Alzheimer's Research & Therapy
52 papers in training set
Top 1%
1.3%
15
Molecular Psychiatry
242 papers in training set
Top 3%
1.1%
16
Annals of Neurology
57 papers in training set
Top 2%
1.0%
17
eBioMedicine
130 papers in training set
Top 3%
0.9%
18
Translational Psychiatry
219 papers in training set
Top 4%
0.9%
19
Nature Neuroscience
216 papers in training set
Top 5%
0.9%
20
Communications Biology
886 papers in training set
Top 20%
0.8%
21
BMC Bioinformatics
383 papers in training set
Top 6%
0.8%
22
NeuroImage: Clinical
132 papers in training set
Top 3%
0.8%
23
Biological Psychiatry
119 papers in training set
Top 2%
0.8%
24
Advanced Science
249 papers in training set
Top 18%
0.8%
25
GeroScience
97 papers in training set
Top 1%
0.8%
26
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 43%
0.8%
27
iScience
1063 papers in training set
Top 30%
0.8%
28
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
29
Neurobiology of Disease
134 papers in training set
Top 4%
0.7%
30
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%