Back

Statistical analysis for the development of a deep learning model for classification of images with TDP-43 pathology

Munoz, A.; Oliveira, V.; Vallejo, M.

2024-02-14 neurology
10.1101/2024.02.12.24300689 medRxiv
Show abstract

Diagnosing Amyotrophic Lateral Sclerosis (ALS) remains challenging due to its inherent heterogeneity. Cytoplasmic aggregation of TDP-43, observed in approximately 95% of ALS cases, has emerged as a key pathological hallmark. In this observational study, we investigated the feasibility of training deep learning models to classify TDP-43 pro-teinopathic samples versus healthy controls, with a particular focus on understanding how dataset limitations affect model performance. The dataset comprised super-resolution immunofluorescence images in which cytoplasmic and nuclear TDP-43 deposits were quantified using red and pink pixel counts. We formulated three classification tasks: TDP-43 pathology (binary), TDP-43 pathology grades (multiclass), and ALS diagnosis (binary). Initial deep learning experiments yielded inconclusive results, prompting dataset curation and the removal of problematic samples. Subsequent statistical analyses using t-tests, ANOVA, and hierarchical clustering revealed significant differences between healthy and pathological samples in terms of pixel distributions, total protein levels, and TDP-43 compart-mentalisation. These findings suggest that classification based on TDP-43 proteinopathy provides a more reliable framework for deep learning compared to ALS diagnosis, underscoring the importance of data quality and task strati-fication in model performance.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Neuropathology and Applied Neurobiology
14 papers in training set
Top 0.1%
28.8%
2
Scientific Reports
3102 papers in training set
Top 5%
10.5%
3
Brain Communications
147 papers in training set
Top 0.1%
9.5%
4
Brain
154 papers in training set
Top 1%
5.0%
50% of probability mass above
5
Journal of Neurology
26 papers in training set
Top 0.2%
3.7%
6
npj Parkinson's Disease
89 papers in training set
Top 0.6%
2.7%
7
NeuroImage: Clinical
132 papers in training set
Top 2%
1.9%
8
Frontiers in Neurology
91 papers in training set
Top 3%
1.8%
9
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.5%
1.8%
10
Neurobiology of Disease
134 papers in training set
Top 2%
1.7%
11
Journal of Neurology, Neurosurgery & Psychiatry
29 papers in training set
Top 0.6%
1.7%
12
Clinical Chemistry
22 papers in training set
Top 0.5%
1.4%
13
Frontiers in Neuroscience
223 papers in training set
Top 5%
1.4%
14
Frontiers in Aging Neuroscience
67 papers in training set
Top 2%
1.3%
15
Multiple Sclerosis Journal
18 papers in training set
Top 0.2%
1.3%
16
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.2%
17
Annals of Neurology
57 papers in training set
Top 2%
0.9%
18
Nature Communications
4913 papers in training set
Top 60%
0.8%
19
eBioMedicine
130 papers in training set
Top 4%
0.8%
20
Communications Biology
886 papers in training set
Top 22%
0.8%
21
NeuroImage
813 papers in training set
Top 6%
0.8%
22
PLOS Digital Health
91 papers in training set
Top 3%
0.7%
23
Clinical Immunology
21 papers in training set
Top 0.6%
0.7%
24
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%
25
PLOS ONE
4510 papers in training set
Top 71%
0.7%
26
Human Genetics and Genomics Advances
70 papers in training set
Top 0.9%
0.7%
27
The Journal of Pathology
22 papers in training set
Top 0.6%
0.7%
28
Journal of Pathology Informatics
13 papers in training set
Top 0.4%
0.7%
29
Nature Computational Science
50 papers in training set
Top 2%
0.7%
30
Alzheimer's & Dementia
143 papers in training set
Top 3%
0.5%