Back

Multicenter analysis of COVID-19 hospitalizations and stacking machine learning algorithms for prediction of high-risk patients

Shaw, R.; Bassily, D.; Patel, L.; O'Connor, T.; Rafidi, R.; Formanek, P.

2023-06-22 health informatics
10.1101/2023.06.20.23291685 medRxiv
Show abstract

ObjectiveTo create and validate an ensemble of machine learning algorithms to accurately predict ICU admission or mortality upon initial presentation to the emergency department. MethodsThis is a retrospective cohort study of a multicenter hospital system in the United States. The electronic health record was queried from March 2020 to December 2021 for patients who presented to the emergency department who were subsequently COVID-positive. Associated patient demographics, vitals, and laboratory vitals were obtained. High-risk individuals were defined as those who required ICU admission or died; low-risk individuals did not meet those criteria. The dataset was split into a 3:1 training to testing dataset. A machine learning ensemble stack was built to predict ICU admission and mortality. ResultsOf the 3,142 hospital admissions with a COVID positive test, there were 1,128 (36%) individuals labeled as high-risk, and 2,014 (64%) as low-risk. We obtained 147 unique variables. CRP, LDH, procalcitonin, glucose, anion gap, creatinine, age, oxygen saturation, oxygen device, and obtainment of an ABG were chosen. Six machine learning models were then trained over model-specific hyperparameters, and then assessed on the testing dataset, generating an area under the receiver operator curve of 0.751, with a specificity of 95% in predicting high-risk individuals based on an initial emergency department assessment. ConclusionA novel machine learning model was generated to predict ICU admission and patient mortality from a multicenter hospital system and validated on unseen data.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
JMIR Medical Informatics
17 papers in training set
Top 0.1%
21.7%
2
Journal of Medical Internet Research
85 papers in training set
Top 0.2%
18.0%
3
International Journal of Medical Informatics
25 papers in training set
Top 0.1%
9.7%
4
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.3%
8.8%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 33%
4.7%
6
Scientific Reports
3102 papers in training set
Top 32%
3.8%
7
JAMIA Open
37 papers in training set
Top 0.5%
2.6%
8
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.2%
2.0%
9
Informatics in Medicine Unlocked
21 papers in training set
Top 0.4%
1.8%
10
JMIR Public Health and Surveillance
45 papers in training set
Top 1%
1.8%
11
BMJ Open
554 papers in training set
Top 9%
1.7%
12
Frontiers in Medicine
113 papers in training set
Top 4%
1.4%
13
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.4%
14
Frontiers in Digital Health
20 papers in training set
Top 0.9%
1.3%
15
BMC Medical Research Methodology
43 papers in training set
Top 0.9%
1.2%
16
BMJ Health & Care Informatics
13 papers in training set
Top 0.6%
1.2%
17
Annals of Translational Medicine
17 papers in training set
Top 1%
0.8%
18
Biomedicines
66 papers in training set
Top 3%
0.8%
19
Clinical Chemistry
22 papers in training set
Top 0.9%
0.7%
20
Life
27 papers in training set
Top 0.5%
0.7%
21
npj Digital Medicine
97 papers in training set
Top 4%
0.7%
22
Critical Care
14 papers in training set
Top 0.7%
0.7%
23
Archives of Clinical and Biomedical Research
28 papers in training set
Top 3%
0.7%
24
Frontiers in Public Health
140 papers in training set
Top 9%
0.7%
25
BioMed Research International
25 papers in training set
Top 4%
0.6%