Back

Diagnosis of Multiple Sclerosis Using Multimodal Deep Learning Integrating Lesion and Normal-Appearing White Matter: A Retrospective Study with International Multicentre External Validation

Ma, J.; Stepanov, V.; Rui, W.; Chen, H.-C.; Lis, M.; Stanek, A.; Puto, T.; Lan, M.; Chen, J.; Liu, T.; Patel, R.; Breen, M.; Lee, M.; Eikermann-Haerter, K.; Shepherd, T. M.; Novikov, D. S.; O'Neill, K. A.; Fieremans, E.; Shen, Y.

2026-03-10 radiology and imaging
10.64898/2026.03.04.26347460 medRxiv
Show abstract

BackgroundCurrent diagnostic criteria for multiple sclerosis (MS) rely on white matter lesions (WMLs), which are not specific and often occur in other disorders. Microstructural abnormalities in normal-appearing white matter (NAWM) may provide complementary information beyond focal lesions. However, the diagnostic use of NAWM in MS remains limited because a reproducible, diagnostically specific NAWM signature has not been established, and NAWM abnormalities detection typically requires quantitative MRI methods beyond routine clinical MRI protocols. MethodsIn this retrospective study, we proposed DeepMS, a deep learning model trained with both quantitative diffusion MRI (dMRI) and structural MRI (sMRI) to diagnose MS by integrating WML and NAWM features captured from routine MRI alone. Development utilized 8,450 scans from 7,703 patients (NYU Langone/ADNI). Evaluation included an internal test set (n=837) and two independent external cohorts: the Krakow cohort (Poland, n=293) and a public multi-site cohort curated from 15 datasets (n=1,756). We compared DeepMS against 2024 McDonald criteria biomarkers (Dissemination in Time [DIT], Dissemination in Space [DIS], Central Vein Sign [CVS], and Paramagnetic Rim Lesion [PRL]) in a multireader study (n=308). To validate the models use of NAWM, we performed lesion-masking experiments (n=550), comparing performance after removal of focal lesions. FindingsDeepMS achieved robust AUCs in the internal (0{middle dot}968 [95% CI 0{middle dot}946-0{middle dot}987]), Krakow (0{middle dot}940 [0{middle dot}898-0{middle dot}974]), and public external (0{middle dot}974 [0{middle dot}966-0{middle dot}982]) cohorts. In the multireader study, DeepMS outperformed established biomarkers: at matched sensitivity (92{middle dot}9%), DeepMS achieved higher specificity than DIS (89{middle dot}0% vs 78{middle dot}5%; p=0{middle dot}0061); at matched specificity (92{middle dot}8%), DeepMS achieved higher sensitivity than CVS (88{middle dot}2% vs 52{middle dot}0%; p<0{middle dot}0001). Furthermore, DeepMS retained diagnostic capability after WML masking (AUC 0{middle dot}959 to 0{middle dot}881) compared to the model trained with only sMRI (0{middle dot}895 to 0{middle dot}764). InterpretationOur findings suggest it is feasible for deep learning models to leverage NAWM-related information directly from routine sMRI. Integrating these features could help MS diagnosis in patients with ambiguous white matter abnormalities. FundingNational Institute of Neurological Disorders and Stroke, the National Institute of Biomedical Imaging and Bioengineering, and the Irma T. Hirschl Trust.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Annals of Neurology
57 papers in training set
Top 0.1%
22.1%
2
NeuroImage: Clinical
132 papers in training set
Top 0.3%
12.3%
3
eBioMedicine
130 papers in training set
Top 0.1%
6.3%
4
Frontiers in Neurology
91 papers in training set
Top 1%
4.8%
5
Science Translational Medicine
111 papers in training set
Top 0.6%
4.2%
6
Brain Communications
147 papers in training set
Top 0.6%
3.9%
50% of probability mass above
7
Multiple Sclerosis Journal
18 papers in training set
Top 0.1%
3.5%
8
Nature Communications
4913 papers in training set
Top 41%
3.5%
9
Neurology
44 papers in training set
Top 0.4%
3.5%
10
BMC Medicine
163 papers in training set
Top 2%
3.0%
11
Human Brain Mapping
295 papers in training set
Top 2%
3.0%
12
European Radiology
14 papers in training set
Top 0.3%
2.6%
13
Imaging Neuroscience
242 papers in training set
Top 2%
2.0%
14
Scientific Reports
3102 papers in training set
Top 51%
2.0%
15
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.4%
2.0%
16
Journal of Internal Medicine
12 papers in training set
Top 0.2%
1.7%
17
Journal of Magnetic Resonance Imaging
14 papers in training set
Top 0.5%
1.2%
18
European Journal of Neurology
20 papers in training set
Top 0.5%
0.9%
19
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
20
Brain
154 papers in training set
Top 5%
0.7%
21
PLOS ONE
4510 papers in training set
Top 69%
0.7%
22
Diagnostics
48 papers in training set
Top 2%
0.7%
23
Movement Disorders
62 papers in training set
Top 1.0%
0.7%
24
Brain Connectivity
22 papers in training set
Top 0.2%
0.7%
25
Alzheimer's & Dementia
143 papers in training set
Top 3%
0.6%