Back

Characterising the motif composition and allele length distribution of ZFHX3 GGC repeat expansions in amyotrophic lateral sclerosis

Zussa, Z. N.; Smith, A. N.; van Vugt, J. J. F. A.; O'Shaughnessy, D. S.; Grima, N.; Chan Moi Fat, S.; Blair, I. P.; Rowe, D. B.; Pamphlett, R.; Nicolson, G. A.; Kiernan, M. C.; van Rheenen, W.; Veldink, J.; Project MinE ALS sequencing consortium, ; Williams, K. L.; Henden, L.

2026-03-10 genetic and genomic medicine
10.64898/2026.03.09.26347973 medRxiv
Show abstract

Background and objectivesA pathogenic GGC repeat expansion in the zinc finger homeobox 3 (ZFHX3) gene, encoding a pure polyglycine tract, is the cause of spinocerebellar ataxia type 4 (SCA4). Intermediate expansions of other SCA loci contribute to the risk of amyotrophic lateral sclerosis (ALS), a fatal neurodegenerative disease involving the progressive loss of motor neurons. There is increasing awareness of the role of short tandem repeat (STR) motif composition and configuration in disease pathogenicity. Given the genetic pleiotropy between ALS and SCA, this study aimed to evaluate whether ZFHX3 GGC expansions were associated with ALS and to characterise repeat motif composition. MethodsExpansionHunter v5 was used to genotype ZFHX3 GGC repeat sizes in short-read whole genome sequencing data from people with ALS and healthy controls of European ancestry. Repeat sizes were visually inspected using REViewer v2. Repeat motif configurations of Australian ALS cases were manually derived from REViewer images. Receiver operating characteristic (ROC) curve analysis and Youdens J statistic were performed to find a candidate repeat size threshold for association testing using Fishers exact test. ResultsAnalysis of 5,785 people with ALS and 7,982 healthy controls found no association between ZFHX3 GGC repeat expansions and disease risk. However, more than 30 unique repeat motif compositions were identified across 802 people with ALS. Of these, seven distinct configurations coded a pure polyglycine tract which, when expanded, is canonical to SCA4. DiscussionAlthough no association was observed between ZFHX3 GGC repeat expansions and ALS, this study established the dynamic nature of ZFHX3 repeat motif composition and configuration. Unique motif compositions were identified both within and between repeat sizes, including the presence of pure polyglycine repeats in ALS. Consideration of repeat motif composition and configuration, in addition to repeat allele length, may be important for assessing neurodegenerative disease risk.

Matching journals

The top 13 journals account for 50% of the predicted probability mass.

1
Muscle & Nerve
10 papers in training set
Top 0.1%
8.5%
2
Scientific Reports
3102 papers in training set
Top 14%
6.9%
3
Human Mutation
29 papers in training set
Top 0.1%
6.9%
4
Movement Disorders
62 papers in training set
Top 0.5%
3.6%
5
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.2%
3.6%
6
Neurobiology of Disease
134 papers in training set
Top 1%
3.6%
7
PLOS ONE
4510 papers in training set
Top 42%
3.1%
8
Journal of Neurology
26 papers in training set
Top 0.3%
2.8%
9
Annals of Neurology
57 papers in training set
Top 0.7%
2.6%
10
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.2%
2.4%
11
Brain Communications
147 papers in training set
Top 1%
2.4%
12
Frontiers in Neurology
91 papers in training set
Top 2%
2.1%
13
Neurology Genetics
14 papers in training set
Top 0.1%
1.9%
50% of probability mass above
14
Genetics in Medicine
69 papers in training set
Top 0.6%
1.9%
15
European Journal of Neurology
20 papers in training set
Top 0.2%
1.9%
16
npj Genomic Medicine
33 papers in training set
Top 0.3%
1.8%
17
Human Molecular Genetics
130 papers in training set
Top 2%
1.8%
18
Journal of Neurology, Neurosurgery & Psychiatry
29 papers in training set
Top 0.6%
1.7%
19
Journal of the Neurological Sciences
17 papers in training set
Top 0.3%
1.7%
20
Frontiers in Genetics
197 papers in training set
Top 6%
1.5%
21
Wellcome Open Research
57 papers in training set
Top 1%
1.3%
22
BMC Genomics
328 papers in training set
Top 3%
1.2%
23
Alzheimer's & Dementia
143 papers in training set
Top 2%
1.2%
24
EBioMedicine
39 papers in training set
Top 0.5%
1.2%
25
Bioinformatics
1061 papers in training set
Top 8%
1.2%
26
Brain
154 papers in training set
Top 3%
1.2%
27
Neurology
44 papers in training set
Top 1%
1.1%
28
Neuropathology and Applied Neurobiology
14 papers in training set
Top 0.4%
1.0%
29
Frontiers in Cellular Neuroscience
79 papers in training set
Top 0.9%
1.0%
30
Molecular and Cellular Neuroscience
18 papers in training set
Top 0.4%
1.0%