Back

Long-read sequencing reveals diverse haplotypes and common structural variants in Alzheimer's Disease GWAS loci

Tesi, N.; Salazar, A.; Bouland, G.; Alvarez Sirvent, D.; Zhang, Y.; Knoop, L.; van Schoor, N. M.; Huisman, M.; Wijesekera, S.; Krizova, J.; Tijms, B.; Vijverberg, E.; ADGC, Bonn, CHARGE, EADB, EADI, FinnGen, GERAD, GR@ACE/DEGESCO, PGC-ALZ, ; Hulsman, M.; van der Lee, S. J.; Reinders, M.; Holstege, H.

2026-03-17 genetic and genomic medicine
10.64898/2026.03.16.26348487 medRxiv
Show abstract

Genome-wide association studies (GWAS) have identified over 100 Single Nucleotide Polymorphisms (SNPs) associated with Alzheimers disease (AD) risk, however, most signals tag haplotypes rather than causal variants. This highlights the need to characterize haplotype-specific variation, including structural variants (SVs) and epigenetic modifications, as these may play a central role in shaping downstream disease mechanisms. We applied linkage disequilibrium (LD)-based clumping, followed by conditional analysis to identify significant and independent haplotypes associated with AD. Through long-read sequencing of 493 individuals, we systematically characterized the SV and DNA methylation landscape of these haplotypes. We integrated allele-specific differential methylation and chromatin organization to prioritize SVs likely contributing to disease mechanisms. Finally, we explored the feasibility of imputation approaches to predict SV size in 5,936 array-genotyped individuals. Using AD-GWAS summary statistics for 98 GWAS loci we identified 280 independent and significant haplotypes. We then identified 2,000 unique SVs that were in LD (R{superscript 2}>0.15) with 207/280 haplotypes. These SVs were predominantly composed of intronic transposable elements and tandem repeats, largely multi-allelic and overlapping regulatory regions. Based on differential methylation, genomic and chromatin co-localization, we prioritized 52 SVs as candidate contributors to disease mechanisms: 14 of these were in high LD with AD-haplotypes (R{superscript 2}>0.8), 12 were in moderate LD (R{superscript 2}>0.5), and 26 were in low LD (R{superscript 2}>0.15). We identified intronic SVs in TMEM106B, CYSTM1, IPMK, LMAN2, MINDY2, as well as likely regulatory and exonic SVs in APP, NDUFS2, TMEM184A, STRN4, CNN2, ADAM10, and other loci. Fine mapping of the PLEC/SHARPIN locus revealed a novel haplotype with a tandem repeat expansion driving enhancer methylation and reduced PLEC expression in microglia. Finally, we imputed 83% of SVs with high accuracy (N=1,651, mean R{superscript 2}=0.76), and association with AD status of imputed SVs yielded 112 significant associations (FDR<0.05). AD risk loci are genetically complex, often comprising multiple haplotypes and linked SVs that could contribute to disease mechanisms. Integrating long-read sequencing, epigenetic data, and imputation strategies provides a more nuanced view of AD genetic architecture and highlights SVs as potential drivers of disease risk.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Nature Genetics
240 papers in training set
Top 0.1%
22.9%
2
The American Journal of Human Genetics
206 papers in training set
Top 0.2%
15.0%
3
Nature Communications
4913 papers in training set
Top 22%
8.6%
4
Alzheimer's & Dementia
143 papers in training set
Top 1%
4.9%
50% of probability mass above
5
Genome Medicine
154 papers in training set
Top 1%
4.9%
6
Brain
154 papers in training set
Top 1%
4.0%
7
Neuron
282 papers in training set
Top 4%
3.7%
8
Cell Genomics
162 papers in training set
Top 2%
3.1%
9
Nature Neuroscience
216 papers in training set
Top 3%
2.8%
10
Science
429 papers in training set
Top 12%
2.1%
11
Cell
370 papers in training set
Top 10%
1.9%
12
Human Genetics and Genomics Advances
70 papers in training set
Top 0.2%
1.8%
13
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 32%
1.7%
14
Nature Aging
51 papers in training set
Top 1%
1.5%
15
Neurobiology of Aging
95 papers in training set
Top 2%
1.2%
16
PLOS Genetics
756 papers in training set
Top 12%
1.0%
17
eLife
5422 papers in training set
Top 51%
1.0%
18
Molecular Psychiatry
242 papers in training set
Top 3%
0.9%
19
Journal of Experimental Medicine
106 papers in training set
Top 3%
0.8%
20
Human Molecular Genetics
130 papers in training set
Top 4%
0.7%
21
Science Advances
1098 papers in training set
Top 32%
0.7%
22
Nature Structural & Molecular Biology
218 papers in training set
Top 5%
0.7%
23
Molecular Neurodegeneration
49 papers in training set
Top 1%
0.7%
24
Nature
575 papers in training set
Top 17%
0.7%
25
Aging Cell
144 papers in training set
Top 4%
0.5%
26
Nature Medicine
117 papers in training set
Top 6%
0.5%
27
Cell Systems
167 papers in training set
Top 14%
0.5%
28
Annals of Neurology
57 papers in training set
Top 3%
0.5%
29
Scientific Reports
3102 papers in training set
Top 80%
0.5%
30
Immunity
58 papers in training set
Top 5%
0.5%