Back

Conformal Prediction and Ensemble Learning for Uncertainty-Aware ICU Mortality Stratification

Kalita, A.; Chattopadhyay, A.; Bhattacharjee, M.; Das, K.

2026-06-02 health informatics
10.64898/2026.05.29.26354474 medRxiv
Show abstract

Background. Conventional ICU severity scores - SOFA, qSOFA, and APACHE-II - use additive integer weightings that cannot capture non-linear organ failure interactions; prospective validations consistently report AUC near 0.73. None quantifies prediction uncertainty, evaluates demographic equity, or acknowledges that their key biomarkers (albumin, creatinine, BUN, lactate, GCS) are also primary confounders of emerging Alzheimer's disease (AD) blood biomarkers p-tau217 and neurofilament light chain (NfL). Methods. Fourteen classifiers were trained on a SOFA-calibrated synthetic ICU cohort (N = 90,000; mortality 29.2%), including an FT-Transformer, XGBoost, and LightGBM tuned by Bayesian optimisation. Seven composite features were engineered from clinical first principles; the novel lactate/albumin ratio (rLA) mirrors the albumin-adjusted p-tau217 correction formula. Post-hoc analyses included nine-method aggregated permutation importance, Monte Carlo Dropout uncertainty decomposition (T = 50), distribution-free conformal prediction, a three-zone triage system, formal ablation, survival analysis, temporal deployment validation, and demographic fairness evaluation. Results. On a natural-distribution held-out cohort (n = 18,000; mortality 29.3%), XGBoost achieved AUC = 0.967 (95% CI 0.965-0.970), surpassing SOFA (AUC = 0.731) by +0.236 (DeLong z = 55.8, p < 0.001; NRI = +0.740). Selective prediction raised FT-Transformer AUC from 0.917 to 0.980 at 50% abstention. Removing neurodegeneration-proxy features reduced AUC by 9.51 percentage points. ML probability was the sole significant covariate in adjusted Cox regression (HR = 6.19, p < 0.001); SOFA, age, lactate, and albumin were non-significant. Temporal AUC range was 0.003 across four deployment windows; sex and age AUC gaps were 0.005 each. Conclusions. This framework delivers well-calibrated, uncertainty-aware ICU mortality prediction with formal coverage guarantees and demographic equity. Ablation-confirmed contributions of neurodegeneration-proxy features, with PDP inflection points aligning with established clinical thresholds, provide a hypothesis-generating quantitative link between routine ICU biomarkers and the AD neurodegeneration pathway warranting prospective validation.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.3%
17.4%
2
Annals of Neurology
57 papers in training set
Top 0.1%
12.3%
3
Scientific Reports
3102 papers in training set
Top 24%
4.8%
4
Nature Communications
4913 papers in training set
Top 37%
4.1%
5
The Lancet Digital Health
25 papers in training set
Top 0.1%
3.9%
6
The Journal of Prevention of Alzheimer's Disease
10 papers in training set
Top 0.1%
3.6%
7
GeroScience
97 papers in training set
Top 0.6%
3.1%
8
eBioMedicine
130 papers in training set
Top 0.6%
2.6%
50% of probability mass above
9
Nature Aging
51 papers in training set
Top 0.7%
2.4%
10
Alzheimer's Research & Therapy
52 papers in training set
Top 0.9%
2.3%
11
Nature Medicine
117 papers in training set
Top 2%
2.1%
12
Age and Ageing
27 papers in training set
Top 0.3%
1.7%
13
Nature Biomedical Engineering
42 papers in training set
Top 0.9%
1.7%
14
PLOS ONE
4510 papers in training set
Top 55%
1.7%
15
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.3%
16
Alzheimer's & Dementia
143 papers in training set
Top 2%
1.3%
17
Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring
38 papers in training set
Top 0.8%
1.3%
18
eClinicalMedicine
55 papers in training set
Top 1%
1.2%
19
Journal of Infection
71 papers in training set
Top 2%
1.1%
20
Communications Medicine
85 papers in training set
Top 0.6%
1.1%
21
Advanced Science
249 papers in training set
Top 17%
0.9%
22
BMC Medicine
163 papers in training set
Top 6%
0.9%
23
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.9%
24
European Respiratory Journal
54 papers in training set
Top 2%
0.9%
25
Patterns
70 papers in training set
Top 2%
0.8%
26
NeuroImage: Clinical
132 papers in training set
Top 4%
0.8%
27
Brain
154 papers in training set
Top 4%
0.8%
28
Frontiers in Aging Neuroscience
67 papers in training set
Top 3%
0.8%
29
Science Translational Medicine
111 papers in training set
Top 6%
0.7%
30
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.7%