Back

AI-based Speech Error Detection to Differentiate Primary Progressive Aphasia Variants

Vonk, J. M. J.; Lian, J.; Cho, C. J.; Antonicelli, G.; Ezzes, Z.; Wauters, L. D.; Keegan-Rodewald, W.; Kurteff, G. L.; Rodriguez, D. A.; Dronkers, N.; Henry, M. L.; Miller, Z. A.; Mandelli, M. L.; Anumanchipalli, G. K.; Gorno-Tempini, M. L.

2026-02-24 neurology
10.64898/2026.02.23.26346899 medRxiv
Show abstract

BackgroundArtificial Intelligence (AI) based approaches to speech analysis have the potential to assist with objective speech error analysis in aphasia but off-the shelf tools often fail to detect speech errors due to prioritizing "fluent transcription." Speech production errors (dysfluencies) are hallmark diagnostic features of the nonfluent (nfvPPA) and logopenic (lvPPA) variants of primary progressive aphasia, yet they can be challenging to detect and characterize even by expert clinicians. This study aimed to evaluate whether the novel automated lightweight Scalable Speech Dysfluency Modeling system (SSDM-L), specifically designed to detect dysfluencies, could accurately distinguish PPA variants using voice recordings of individuals reading a brief passage. MethodParticipants included a total of 104 individuals, 40 with nfvPPA, 40 with lvPPA (matched on disease severity), and 24 healthy controls who read aloud the Grandfather Passage as part of a widely used motor speech evaluation (MSE). We automatically extracted ten speech error (dysfluency) variables using SSDM-L, including insertions, replacements, and deletions at both phoneme- and word-levels, and phoneme-level prolongations and repetitions. Group differences were assessed via ANCOVAs controlling for age, education, and disease severity (MMSE, CDR sum-of-boxes). To test clinical relevance, we performed correlation analyses with MSE ratings provided by experienced speech-language pathologists (i.e., gold standard) within the nfvPPA group. Classification performance was assessed by training random forest and XGBoost machine-learning models including 5-fold cross-validation. ResultsAll individuals read the entire passage in less than five minutes. SSDM-L detected eight of the ten predefined dysfluency features at sufficient frequency to include them in subsequent analyses. All eight features distinguished PPA from controls (p<.006). Individuals with nfvPPA made more errors than the lvPPA group on every feature (all p<.023). Each feature showed a moderate positive correlation with a global MSE apraxia/dysarthria score (r=.31-.56; p<.001-.053). Together, the eight features were able to classify nfvPPA versus lvPPA at AUC=.806 (random forest) and AUC=.776 (XGBoost). DiscussionAI-based automated speech error analysis accurately distinguished nfvPPA and lvPPA variants using a brief reading task. This quick error-sensitive scalable AI system has the potential of providing a practical tool to aid diagnosis in aphasia and motor speech disorders.

Matching journals

The top 11 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 14%
6.8%
2
Brain Communications
147 papers in training set
Top 0.2%
6.8%
3
Journal of NeuroEngineering and Rehabilitation
28 papers in training set
Top 0.2%
6.4%
4
PLOS ONE
4510 papers in training set
Top 28%
6.3%
5
Journal of Speech, Language, and Hearing Research
10 papers in training set
Top 0.1%
4.9%
6
Journal of Alzheimer’s Disease
39 papers in training set
Top 0.2%
4.0%
7
NeuroImage: Clinical
132 papers in training set
Top 1%
3.6%
8
Annals of Neurology
57 papers in training set
Top 0.6%
3.6%
9
Neurology
44 papers in training set
Top 0.4%
3.6%
10
Frontiers in Neurology
91 papers in training set
Top 2%
3.6%
11
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.3%
2.9%
50% of probability mass above
12
Frontiers in Digital Health
20 papers in training set
Top 0.4%
2.6%
13
Brain
154 papers in training set
Top 2%
2.6%
14
Cortex
102 papers in training set
Top 0.2%
1.9%
15
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.9%
16
Journal of Alzheimer's Disease
43 papers in training set
Top 0.7%
1.8%
17
Frontiers in Neuroscience
223 papers in training set
Top 4%
1.7%
18
Alzheimer's Research & Therapy
52 papers in training set
Top 1%
1.7%
19
BMC Neurology
12 papers in training set
Top 0.5%
1.3%
20
Journal of Neurology
26 papers in training set
Top 0.8%
1.3%
21
European Journal of Neurology
20 papers in training set
Top 0.4%
1.2%
22
Translational Psychiatry
219 papers in training set
Top 3%
1.0%
23
Clinical Neurophysiology
50 papers in training set
Top 0.5%
1.0%
24
Annals of the New York Academy of Sciences
12 papers in training set
Top 0.1%
0.8%
25
Epilepsia
49 papers in training set
Top 0.7%
0.8%
26
Neurorehabilitation and Neural Repair
17 papers in training set
Top 0.5%
0.7%
27
eBioMedicine
130 papers in training set
Top 4%
0.7%
28
Artificial Intelligence in Medicine
15 papers in training set
Top 0.7%
0.7%
29
PLOS Digital Health
91 papers in training set
Top 3%
0.7%
30
Neuroscience & Biobehavioral Reviews
43 papers in training set
Top 1%
0.7%