Back

An Explainable Deep Learning Framework for Imaging Genetics: Deriving Brain-Genotype Scores From MRI to Link Genetic Variation, Brain Structure, and Cognition

Alhasani, K. T.; Ghose, U.; Sammet, J.; Zhu, T.; Xiao, S.; Hastoy, B.; Brennan, P.; froud, K.; Ulm, B.; Duijn, C. v.; Winchester, L. M.; Marsden, B. D.; Nevado-Holgado, A.

2026-05-08 radiology and imaging
10.64898/2026.05.06.26352595 medRxiv
Show abstract

Imaging genetics aims to understand how genetic variation influences brain structure and cognitive function. Traditional approaches often rely on imaging-derived phenotypes (IDPs), which require high-dimensional brain images to be reduced to predefined summary measures and may therefore miss subtle or spatially distributed genotype-related effects. We developed a two-stage framework that integrates deep learning and statistical modelling to derive and exploit brain-genotype scores--continuous, image-based representations of genetic variation learned directly from structural MRI. In the first stage, we trained a multi-task 3D convolutional neural network (CNN) on T1-weighted MRI scans from the UK Biobank, a large, population-based cohort, to predict single-nucleotide polymorphism (SNP) variation, producing brain-genotype scores that capture distributed neuroanatomical patterns associated with specific genetic variants. Unlike conventional IDPs, these scores are learned directly from raw images and are designed to encode genotype-related brain structure without reliance on predefined regional features. Gradient-based saliency maps were used to localise neuroanatomical regions contributing to each score, providing interpretable links between genetic variation and brain anatomy. In the second stage, brain-genotype scores derived from the held-out test set were used as quantitative neuroanatomical markers in association analyses with cognitive performance. These scores showed robust, Bonferroni-corrected associations with multiple cognitive measures, including fluid intelligence, reaction time, and memory performance. In contrast, traditional machine learning models trained on IDPs failed to generate comparably in-formative scores. This integrated framework demonstrates that brain-genotype scores provide a flexible and interpretable representation of genotype-related neuroanatomical variation, enabling the discovery of biologically meaningful links between genetic variation, brain structure, and cognition that are difficult to detect using traditional imaging genetic approaches.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 10%
14.6%
2
Science Translational Medicine
111 papers in training set
Top 0.1%
10.0%
3
Human Brain Mapping
295 papers in training set
Top 0.8%
8.4%
4
NeuroImage
813 papers in training set
Top 2%
6.3%
5
Nature Computational Science
50 papers in training set
Top 0.1%
4.3%
6
Nature Machine Intelligence
61 papers in training set
Top 0.7%
4.3%
7
Scientific Reports
3102 papers in training set
Top 34%
3.7%
50% of probability mass above
8
Communications Biology
886 papers in training set
Top 2%
3.2%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 23%
3.1%
10
Nature
575 papers in training set
Top 8%
2.9%
11
Science Advances
1098 papers in training set
Top 10%
2.7%
12
NeuroImage: Clinical
132 papers in training set
Top 2%
2.6%
13
eBioMedicine
130 papers in training set
Top 0.6%
2.6%
14
eLife
5422 papers in training set
Top 38%
1.9%
15
Nature Medicine
117 papers in training set
Top 2%
1.9%
16
Imaging Neuroscience
242 papers in training set
Top 2%
1.9%
17
Nature Methods
336 papers in training set
Top 4%
1.7%
18
Nature Neuroscience
216 papers in training set
Top 4%
1.7%
19
Patterns
70 papers in training set
Top 1%
1.5%
20
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.5%
21
Biological Psychiatry
119 papers in training set
Top 2%
1.2%
22
Brain Communications
147 papers in training set
Top 2%
1.2%
23
Neuron
282 papers in training set
Top 7%
1.1%
24
Nature Genetics
240 papers in training set
Top 6%
0.9%
25
Science
429 papers in training set
Top 18%
0.9%
26
Genome Medicine
154 papers in training set
Top 9%
0.7%
27
Cerebral Cortex
357 papers in training set
Top 2%
0.7%
28
Cell
370 papers in training set
Top 19%
0.6%
29
iScience
1063 papers in training set
Top 37%
0.6%