Back

Perioperative Mortality Prediction Using a Bayesian Ensemble with Prevalence-Adaptive Gating

Pandey, A. K.

2026-04-06 health informatics
10.64898/2026.04.03.26350114 medRxiv
Show abstract

Background: Perioperative mortality prediction in resource-limited surgical settings remains challenging due to class imbalance, missing data, and the heterogeneity of postoperative complications. Existing risk scores such as POSSUM depend on intraoperative variables and do not quantify prediction uncertainty. Methods: We developed a prevalence-adaptive Bayesian ensemble comprising three stochastic models: classifier Variational Autoencoder (VAE, AUC=0.95), a Flipout Last Layer network (AUC=0.84), and a Monte Carlo Dropout network (AUC=0.80), trained on 697 patients (39 deaths, prevalence 5.59%) with 67 preoperative and postoperative features. Class imbalance (16.9:1) was addressed through Variational Autoencoder augmentation: two class-conditional generative VAEs produced 619 synthetic survivor and 619 synthetic death records, yielding a balanced training corpus of 1,935 samples. VAE augmentation was selected over SMOTE and random oversampling after a comparative study (F1: random oversampling 0.61 vs VAE augmentation 0.77). Validation used a held-out set of 233 patients (13 deaths, 220 survivors). A six-stage prediction pipeline incorporated weighted base risk, a three-path prevalence-adaptive gate, Shannon entropy uncertainty quantification, and rank-transform calibration. Sensitivity analysis was conducted across all six empirically derived hyperparameters. A whole-cohort death audit evaluated all 52 deaths from the complete 930-patient dataset through the deployed clinical decision support system. Statistical analysis included Kruskal-Wallis testing of entropy across triage groups, Wilson score confidence intervals for performance metrics, and Spearman rank correlation for LIME-SHAP interpretability concordance. Results: On the validation cohort the ensemble achieved complete separation (sensitivity 100%, specificity 100%, Youden J=1.000; TP=13, FP=0, TN=220, FN=0). The whole-cohort death audit identified 36 of 52 deaths (sensitivity 69.2%, 95% CI 55.7%-80.1%; precision 100%, 95% CI 90.4%-100.0%; F1=0.818, bootstrap 95% CI 0.732-0.894). Shannon entropy differed significantly across triage levels (Kruskal-Wallis H(2)=24.212, p<0.001, {epsilon}2=0.453), confirming a monotone gradient SAFE < CRITICAL < GRAY ZONE. All six hyperparameters were invariant across their tested ranges (J=1.000 throughout; Supplementary Tables S1-S2). LIME and SHAP rankings showed statistically significant concordance (Spearman {rho}=0.440, p=0.024; Kendall T=0.357, p=0.011), with 4 of 6 principal mortality determinants shared across both methods. Conclusions: A prevalence-adaptive Bayesian ensemble with entropy-based uncertainty triage achieves zero false positive alerts and clinically meaningful audit sensitivity in perioperative mortality prediction. Complete hyperparameter invariance confirms that reported performance reflects structural properties of the calibration architecture. The 16 missed deaths represent feature-invisible cases beyond current observational feature capacity.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 0.8%
18.8%
2
npj Digital Medicine
97 papers in training set
Top 0.4%
12.5%
3
Nature Communications
4913 papers in training set
Top 32%
4.9%
4
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.2%
4.4%
5
Critical Care Explorations
15 papers in training set
Top 0.1%
4.4%
6
JMIR Medical Informatics
17 papers in training set
Top 0.3%
4.0%
7
PLOS ONE
4510 papers in training set
Top 37%
3.7%
50% of probability mass above
8
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.1%
3.6%
9
The Lancet Digital Health
25 papers in training set
Top 0.2%
2.8%
10
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.9%
2.6%
11
European Heart Journal - Digital Health
15 papers in training set
Top 0.3%
1.8%
12
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.8%
13
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.4%
1.7%
14
BMJ Health & Care Informatics
13 papers in training set
Top 0.5%
1.5%
15
Journal of Infection
71 papers in training set
Top 2%
1.3%
16
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 4%
1.3%
17
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.3%
18
Science Advances
1098 papers in training set
Top 23%
1.2%
19
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
20
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.8%
21
Annals of Internal Medicine
27 papers in training set
Top 0.8%
0.8%
22
Biology Methods and Protocols
53 papers in training set
Top 2%
0.8%
23
British Journal of Anaesthesia
14 papers in training set
Top 0.7%
0.8%
24
Frontiers in Medicine
113 papers in training set
Top 6%
0.8%
25
Communications Biology
886 papers in training set
Top 23%
0.8%
26
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
27
Frontiers in Digital Health
20 papers in training set
Top 2%
0.7%
28
Experimental Neurology
57 papers in training set
Top 2%
0.7%
29
Journal of the American Heart Association
119 papers in training set
Top 4%
0.7%
30
JAMIA Open
37 papers in training set
Top 2%
0.7%