Back

Outcome Prediction Models for Critically Ill Patients Using Small Routine Laboratory Datasets

Cao, X.; Hou, J.; Wei, X.; Wang, Q.

2026-04-27 emergency medicine
10.64898/2026.04.26.26351758 medRxiv
Show abstract

We present a suite of foundational, outcome prediction models for critically ill patients, developed using readily available, routine blood tests and advanced machine learning techniques. The input data of the models includes complete blood counts (CBCs), metabolic panels, and additional biomarkers that assess liver and kidney function, coagulation status, and cardiac injury. The output yields the predicted outcome at a given future horizon. For diagnoses, the length of the future horizon is set to zero, while it is set to a fixed time interval for prognoses. The training dataset in this study comprises clinical data from 332 ICU patients, augmented with 200 synthetic samples generated via a conditional diffusion model. Generative machine learning based data imputation and augmentation approaches yielded modest gains in predictive accuracy. However, substantial performance improvements were achieved through additional methods, including dimensionality and order reduction, SHAP based feature importance analysis, and a novel time series to image encoding strategy that enables the use of image based classifiers for temporal clinical data. Principal component analysis based order reduction produced measurable gains in outcome prediction, while the time series to image encoding proved particularly effective in mitigating small data limitations common in clinical research. Across all evaluation metrics, accuracy, precision, recall, F1 score, and AUROC, the prognostic models achieved performance exceeding 85\%, with some models attaining AUROC scores above 90%. We innovated a new model ensemble approach to optimize the predictive outcome. This ensemble modeling approach improves the overall prediction, pushing all assessment metrics over 90% . This work establishes a robust and interpretable AI enabled diagnostic and prognostic toolkit for outcome predictions in critically ill patients and demonstrates a scalable workflow for developing high performing models from sparse healthcare datasets. The proposed framework is readily deployable in ICU environments with routine blood testing capabilities and serves as a foundation for future integration into digital twin systems for critical care.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
PLOS Digital Health
91 papers in training set
Top 0.1%
23.6%
2
Scientific Reports
3102 papers in training set
Top 1%
15.4%
3
PLOS ONE
4510 papers in training set
Top 23%
7.5%
4
Artificial Intelligence in Medicine
15 papers in training set
Top 0.1%
5.1%
50% of probability mass above
5
npj Digital Medicine
97 papers in training set
Top 0.9%
5.1%
6
Cytometry Part A
30 papers in training set
Top 0.1%
3.8%
7
PLOS Computational Biology
1633 papers in training set
Top 11%
3.2%
8
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
2.6%
9
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.7%
2.2%
10
Journal of Medical Internet Research
85 papers in training set
Top 2%
2.0%
11
Frontiers in Public Health
140 papers in training set
Top 4%
1.9%
12
Biology Methods and Protocols
53 papers in training set
Top 0.8%
1.8%
13
Frontiers in Medicine
113 papers in training set
Top 3%
1.8%
14
Patterns
70 papers in training set
Top 1%
1.4%
15
International Journal of Medical Informatics
25 papers in training set
Top 1.0%
1.4%
16
Advanced Science
249 papers in training set
Top 14%
1.3%
17
Heliyon
146 papers in training set
Top 3%
1.3%
18
iScience
1063 papers in training set
Top 23%
1.2%
19
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.8%
21
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%
22
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.7%
23
Frontiers in Physiology
93 papers in training set
Top 7%
0.7%
24
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 1%
0.7%
25
Expert Systems with Applications
11 papers in training set
Top 0.6%
0.5%
26
IEEE Access
31 papers in training set
Top 1%
0.5%