Back

Strengthening Deep-learning Models for Intracranial Hemorrhage Detection: Strongly Annotated Computed Tomography Images and Model Ensembles

Kang, D.-W.; Park, G.-H.; Ryu, W.-S.; Schellingerhout, D.; Kim, M.; Kim, Y. S.; Park, C.-Y.; Lee, K.-J.; Han, M.-K.; Jeong, H.-G.; Kim, D.-E.

2023-08-25 neurology
10.1101/2023.08.24.23293394 medRxiv
Show abstract

Multiple attempts at intracranial hemorrhage (ICH) detection using deep-learning techniques have been made and plagued with clinical failures. Most studies for ICH detection have insufficient data or weak annotations. We sought to determine whether a deep-learning algorithm for ICH detection trained on a strongly annotated dataset outperforms that trained on a weakly annotated dataset, and whether a weighted ensemble model that integrates separate models trained using datasets with different ICH subtypes is more accurate. We used publicly available brain CT scans from the Radiological Society of North America (27,861 CT scans, 3,528 ICHs) and AI-Hub (53,045 CT scans, 7,013 ICHs) for training datasets. For external testing, 600 CT scans (327 with ICH) from Dongguk University Medical Center and 386 CT scans (160 with ICH) from Qure.ai were used. DenseNet121, InceptionResNetV2, MobileNetV2, and VGG19 were trained on strongly and weakly annotated datasets and compared. We then developed a weighted ensemble model combining separate models trained on all ICH, subdural hemorrhage (SDH), subarachnoid hemorrhage (SAH), and small-lesion ICH cases. The final weighted ensemble model was compared to four well-known deep-learning models. Six neurologists reviewed difficult ICH cases after external testing. InceptionResNetV2, MobileNetV2, and VGG19 models outperformed when trained on strongly annotated datasets. A weighted ensemble model combining models trained on SDH, SAH, and small-lesion ICH had a higher AUC than a model only trained on all ICH cases. This model outperformed four well-known deep-learning models in terms of sensitivity, specificity, and AUC. Strongly annotated data are superior to weakly annotated data for training deep-learning algorithms. Since no model can capture all aspects of a complex task well, we developed a weighted ensemble model for ICH detection after training with large-scale strongly annotated CT scans. We also showed that a better understanding and management of cases challenging for AI and human is required to facilitate clinical use of ICH detection algorithms. Key PointsQuestion Can a weighted ensemble method and strongly annotated training datasets develop a deep-learning model with high accuracy to detect intracranial hemorrhage? Findings A deep-learning algorithm for detecting ICH trained with a strongly annotated dataset outperformed models trained with a weakly annotated dataset. After ensembling separate models that were trained with only SDH, SAH, and small-lesion ICH, a weighted ensemble model had a higher AUC. Meaning This study suggests that to enhance the performance of deep-learning models, researchers should consider the distinct imaging characteristics of each hemorrhage subtype and use strongly annotated training datasets.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Frontiers in Neurology
91 papers in training set
Top 0.1%
23.2%
2
Journal of the Neurological Sciences
17 papers in training set
Top 0.1%
6.5%
3
European Radiology
14 papers in training set
Top 0.1%
5.0%
4
PLOS ONE
4510 papers in training set
Top 33%
4.4%
5
BMJ Open
554 papers in training set
Top 6%
3.4%
6
Neurocritical Care
11 papers in training set
Top 0.1%
3.4%
7
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.3%
2.7%
8
Brain Communications
147 papers in training set
Top 1%
2.1%
50% of probability mass above
9
Stroke: Vascular and Interventional Neurology
13 papers in training set
Top 0.2%
2.1%
10
Neurology
44 papers in training set
Top 0.7%
1.9%
11
Alzheimer's & Dementia
143 papers in training set
Top 2%
1.9%
12
Neuroinformatics
40 papers in training set
Top 0.5%
1.7%
13
Frontiers in Neuroscience
223 papers in training set
Top 4%
1.7%
14
Scientific Reports
3102 papers in training set
Top 56%
1.7%
15
npj Digital Medicine
97 papers in training set
Top 2%
1.7%
16
BMC Neurology
12 papers in training set
Top 0.4%
1.7%
17
NeuroImage: Clinical
132 papers in training set
Top 2%
1.7%
18
EClinicalMedicine
21 papers in training set
Top 0.4%
1.4%
19
Clinical Chemistry
22 papers in training set
Top 0.5%
1.3%
20
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.3%
21
Journal of Neurology
26 papers in training set
Top 0.9%
1.1%
22
Stroke
35 papers in training set
Top 0.6%
1.0%
23
PLOS Digital Health
91 papers in training set
Top 2%
1.0%
24
Journal of Neurology, Neurosurgery & Psychiatry
29 papers in training set
Top 1%
0.8%
25
BMJ Health & Care Informatics
13 papers in training set
Top 0.8%
0.8%
26
Neuro-Oncology Advances
24 papers in training set
Top 0.4%
0.8%
27
Journal of the American Heart Association
119 papers in training set
Top 4%
0.8%
28
Journal of Clinical Medicine
91 papers in training set
Top 6%
0.8%
29
Frontiers in Integrative Neuroscience
12 papers in training set
Top 0.3%
0.8%
30
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.8%