Back

Machine learning for the prediction of in-hospital mortality in patients with spontaneous intracerebral hemorrhage

Mao, B.; Zhang, R.; Pan, Y.; Zheng, R.; Shen, Y.; Lu, W.; Lu, Y.; Shanhu, X.; Wu, J.; Wang, M.; Wan, S.

2023-08-16 neurology
10.1101/2023.08.15.23294147 medRxiv
Show abstract

BackgroundsEarly and accurate identification of patients with spontaneous intracerebral hemorrhage(sICH) who are at high risk of in-hospital death can help intensive care unit (ICU) physicians make optimal clinical decisions. The aim of this study was to develop a machine learning(ML)-based tool to predict the risk of in-hospital death in patients with sICH in ICU. MethodsWe conducted a retrospective administrative database study using the MIMIC-IV and Zhejiang Hospital database. The outcome of the study was in-hospital mortality. To develop and validate the final model, we employed the LASSO regression to screen and select relevant variables. Five algorithms, namely Logistic Regression (LR), K-Nearest Neighbors (KNN), Adaptive Boosting (AdaBoost), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost), were utilized. The selection of the best model was based on the area under the curve (AUC) in the validation cohort. Furthermore, we employ the SHapley Additive exPlanations (SHAP) methodology to elucidate the contributions of individual features to the model and analyze their impact on the models outputs. To facilitate accessibility, we also created a visual online calculation page for the model. ResultsIn the final cohort comprising 1596 patients from MIMIC-IV and Zhejiang Hospital, 367 individuals (23%) experienced in-hospital mortality during the inpatient follow-up period. After extracting 46 variables from the database, LASSO regression identified 14 predictor variables for further analysis. Among the five evaluated models, the XGBoost model demonstrated superior discriminative power in both the internal validation set (AUC = 0.907) and the external validation set (AUC = 0.787). Furthermore, through the SHAP technique, we identified the top 5 predictors in the feature importance rankings: Glasgow Coma Scale (GCS), Sequential Organ Failure Assessment (SOFA), anticoagulant medication, mannitol medication and oxygen saturation. ConclusionsAmong the five models, the XGBoost model exhibited superior performance in predicting mortality for patients with sICH in the ICU, indicating its potential significance in the development of early warning systems.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.1%
33.8%
2
PLOS ONE
4510 papers in training set
Top 18%
10.3%
3
Scientific Reports
3102 papers in training set
Top 22%
5.0%
4
Journal of Medical Internet Research
85 papers in training set
Top 1%
4.1%
50% of probability mass above
5
Neurocritical Care
11 papers in training set
Top 0.1%
3.8%
6
BioMed Research International
25 papers in training set
Top 0.7%
3.7%
7
Computers in Biology and Medicine
120 papers in training set
Top 1%
2.4%
8
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.2%
1.9%
9
Biomedicines
66 papers in training set
Top 0.7%
1.8%
10
Medicine
30 papers in training set
Top 1%
1.7%
11
Journal of Stroke and Cerebrovascular Diseases
12 papers in training set
Top 0.3%
1.4%
12
Frontiers in Neurology
91 papers in training set
Top 3%
1.4%
13
Journal of Neurology
26 papers in training set
Top 0.8%
1.3%
14
JMIR Medical Informatics
17 papers in training set
Top 1%
1.3%
15
Clinical Chemistry
22 papers in training set
Top 0.6%
1.0%
16
International Journal of Medical Informatics
25 papers in training set
Top 1%
0.9%
17
PLOS Digital Health
91 papers in training set
Top 2%
0.8%
18
Journal of Clinical Medicine
91 papers in training set
Top 6%
0.8%
19
Annals of Translational Medicine
17 papers in training set
Top 1%
0.8%
20
Frontiers in Digital Health
20 papers in training set
Top 1%
0.8%
21
Stroke: Vascular and Interventional Neurology
13 papers in training set
Top 0.4%
0.8%
22
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
0.8%
23
eLife
5422 papers in training set
Top 57%
0.8%
24
BMC Neurology
12 papers in training set
Top 0.9%
0.8%
25
Frontiers in Medicine
113 papers in training set
Top 7%
0.8%
26
International Journal of Epidemiology
74 papers in training set
Top 3%
0.7%
27
Medical Image Analysis
33 papers in training set
Top 1%
0.7%
28
BMJ Health & Care Informatics
13 papers in training set
Top 0.9%
0.7%
29
EClinicalMedicine
21 papers in training set
Top 1%
0.7%
30
Artificial Intelligence in Medicine
15 papers in training set
Top 0.8%
0.7%