Back

Normative Speech Modeling for ALS Diagnosis with Application to Other Neurodegenerative Diseases

Shah, M.

2026-05-27 neurology
10.64898/2026.05.25.26354057 medRxiv
Show abstract

Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative disease affecting more than 450,000 individuals worldwide and is frequently diagnosed more than 12 months after symptom onset, delaying intervention during a critical early window. Because up to 80% of patients develop dysarthria within two years, subtle changes in speech provide a signal of early bulbar motor neuron degeneration. However, existing speech-based systems rely on supervised classification trained on limited datasets, achieving moderate sensitivity and depending heavily on labeled disease examples, which restrict scalability and early detection. This study introduces SPEAK-NORM, the first-ever normative speech modeling framework for early ALS diagnosis, which learns age- and sex-conditioned motor-speech distributions exclusively from healthy individuals. A conditional variational autoencoder models coordination of hypoglossal, laryngeal, and respiratory motor pathways, and deviation from this healthy manifold is quantified through latent representations and reconstruction error to form a 354-dimensional profile. A calibrated linear Support Vector Machine performs subject-level classification under subject-disjoint validation. On the VOC-ALS database (n = 153), SPEAK-NORM achieves 98% accuracy with balanced sensitivity and specificity, significantly outperforming established clinical acoustic indices and prior systems. The framework maintains strong performance under cross-task generalization and when retrained on healthy controls in independent dementia and Parkinson disease cohorts, demonstrating disease-specific deviation patterns rather than generic neurodegenerative change. Spectral, temporal, and latent separations further support interpretability. By modeling healthy speech instead of memorizing disease examples, SPEAK-NORM enables scalable early neuromotor screening using recording devices, with potential to support earlier diagnosis, differential classification, and monitoring of ALS progression.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 2%
8.4%
2
IEEE Transactions on Biomedical Engineering
38 papers in training set
Top 0.1%
6.3%
3
Scientific Reports
3102 papers in training set
Top 18%
6.3%
4
Nature Communications
4913 papers in training set
Top 30%
6.3%
5
Nature Biomedical Engineering
42 papers in training set
Top 0.2%
4.8%
6
Nature Computational Science
50 papers in training set
Top 0.1%
4.8%
7
Imaging Neuroscience
242 papers in training set
Top 0.9%
3.9%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 18%
3.9%
9
Brain
154 papers in training set
Top 2%
3.6%
10
eBioMedicine
130 papers in training set
Top 0.4%
3.6%
50% of probability mass above
11
Nature Medicine
117 papers in training set
Top 0.9%
3.6%
12
Communications Biology
886 papers in training set
Top 2%
3.6%
13
Journal of Neural Engineering
197 papers in training set
Top 0.8%
3.0%
14
Med
38 papers in training set
Top 0.2%
1.9%
15
Scientific Data
174 papers in training set
Top 0.9%
1.9%
16
Nature Machine Intelligence
61 papers in training set
Top 2%
1.9%
17
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
18
Nature Neuroscience
216 papers in training set
Top 4%
1.7%
19
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.7%
20
Human Brain Mapping
295 papers in training set
Top 3%
1.7%
21
Brain Communications
147 papers in training set
Top 2%
1.5%
22
PLOS ONE
4510 papers in training set
Top 56%
1.5%
23
NeuroImage
813 papers in training set
Top 5%
1.3%
24
npj Digital Medicine
97 papers in training set
Top 3%
1.3%
25
eLife
5422 papers in training set
Top 52%
0.9%
26
NeuroImage: Clinical
132 papers in training set
Top 3%
0.9%
27
Frontiers in Digital Health
20 papers in training set
Top 1%
0.9%
28
Frontiers in Neuroscience
223 papers in training set
Top 6%
0.9%
29
iScience
1063 papers in training set
Top 29%
0.8%
30
Nature
575 papers in training set
Top 16%
0.7%