Back

Acute-Phase Machine Learning Prediction of 12-Month Aphasia and Discourse Recovery

Marte, M. J.; Chaves, M.; Kelly, L.; Diaz-Carr, I.; Neal, V.; Faria, A. V.; Stockbridge, M. D.; Hillis, A. E.

2026-05-17 neurology
10.64898/2026.05.13.26353123 medRxiv
Show abstract

Approximately 30-40% of stroke patients retain aphasia at 12 months. Early forecasting may guide rehabilitation and prognostic enrichment of clinical trials, yet machine learning (ML) prediction of language recovery has typically relied on chronic-phase data unavailable at the acute decision point. Whether acute features predict 12-month outcomes, and whether global severity and connected-speech recovery share substrates in an ML framework, is untested. We studied 73 patients with acute left-hemisphere ischemic stroke and aphasia (mean 2.8 days post-onset). Two 12-month outcomes were defined: aphasia resolution (Western Aphasia Battery-Revised Aphasia Quotient [WAB-AQ] [≥]93.8) and discourse normalization (Modern Cookie Theft content units [≥]22.1; N=61). Four ML algorithms were trained on four hierarchical feature sets (clinical, volumetric, anatomical, network-disconnection) using nested cross-validation and SHapley Additive exPlanations (SHAP) stability analysis. Acute WAB-AQ dominated (mean |SHAP| = 13.60, ~20x the next feature). For aphasia resolution, random forest achieved F1 = 0.874 (95% CI, 0.800-0.941), Pearson r = 0.827, mean absolute error (MAE) = 7.26 WAB-AQ points; clinical features alone achieved F1 = 0.851. For discourse, support vector regression achieved F1 = 0.725 (95% CI 0.593-0.831), r = 0.617, MAE = 8.96 content units. Three predictors were shared (acute WAB-AQ, lesion volume, left pars triangularis); ventral-stream tracts were linked to aphasia resolution, whereas interhemispheric and prefrontal connectivity were linked to discourse. Both models overpredicted severe chronic outcomes. Acute-phase ML forecasts 12-month aphasia resolution accurately and discourse more modestly. Clinical features carry most predictive variance; acute imaging reveals shared and outcome-specific substrates mapping onto dual-stream architecture, supporting early stratification for rehabilitation and prognostic trial enrichment.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Brain
154 papers in training set
Top 0.1%
22.7%
2
Nature Communications
4913 papers in training set
Top 14%
12.4%
3
Nature Medicine
117 papers in training set
Top 0.1%
10.5%
4
Brain Communications
147 papers in training set
Top 0.4%
4.9%
50% of probability mass above
5
NeuroImage: Clinical
132 papers in training set
Top 0.9%
4.9%
6
Nature
575 papers in training set
Top 6%
4.0%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 20%
3.6%
8
Nature Neuroscience
216 papers in training set
Top 3%
3.1%
9
eLife
5422 papers in training set
Top 35%
2.1%
10
Nature Genetics
240 papers in training set
Top 3%
2.1%
11
Med
38 papers in training set
Top 0.2%
1.9%
12
Nature Biomedical Engineering
42 papers in training set
Top 0.8%
1.8%
13
Nature Cancer
35 papers in training set
Top 0.9%
1.5%
14
Neuron
282 papers in training set
Top 6%
1.5%
15
Nature Computational Science
50 papers in training set
Top 0.9%
1.3%
16
npj Digital Medicine
97 papers in training set
Top 3%
1.3%
17
Scientific Reports
3102 papers in training set
Top 65%
1.2%
18
Annals of Neurology
57 papers in training set
Top 2%
1.2%
19
Alzheimer's & Dementia
143 papers in training set
Top 2%
0.9%
20
Science Translational Medicine
111 papers in training set
Top 5%
0.8%
21
EMBO Molecular Medicine
85 papers in training set
Top 4%
0.8%
22
Neurology
44 papers in training set
Top 2%
0.8%
23
Circulation
66 papers in training set
Top 3%
0.7%
24
Frontiers in Neurology
91 papers in training set
Top 6%
0.6%
25
Cell Stem Cell
57 papers in training set
Top 3%
0.6%
26
Stroke
35 papers in training set
Top 0.9%
0.5%
27
Cell Metabolism
49 papers in training set
Top 3%
0.5%