Back

PRAM: Post-hoc Retrieval Augmentation for Parameter-Free Domain Adaptation of ICU Clinical Prediction Models

Jeong, I.; Lee, T.; Kim, B.; Park, J.-H.; Kim, Y.; Lee, H.

2026-04-05 health systems and quality improvement
10.64898/2026.04.03.26350132 medRxiv
Show abstract

Background Clinical prediction models degrade when deployed across hospitals, yet retraining requires technical expertise, labeled data, and regulatory re-approval. We investigated whether post-hoc retrieval augmentation of a frozen model's output, analogous to retrieval-augmented methods in natural language processing, can mitigate this degradation without any parameter modification. Methods We developed the Post-hoc Retrieval Augmentation Module (PRAM), which combines predictions from a frozen base model with outcome information retrieved from similar patients in a local patient bank. Five base models (logistic regression through CatBoost) and three retrieval strategies were evaluated on 116,010 ICU patients across three databases (MIMIC-IV, MIMIC-III, eICU-CRD) for acute kidney injury (AKI) and mortality prediction. A bank size deployment simulation modeled performance from zero to full local data accumulation, complemented by source bank cold start, stress tests, and calibration experiments. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC). Results Retrieval benefit was inversely associated with base model complexity ({rho} = -0.90 for AKI, -1.00 for mortality): simpler models benefited more, consistent with retrieval capturing residual signal unexploited by the base model. PRAM showed a statistically significant monotone dose-response between bank size and prediction performance across all six outcome-target combinations (Kendall {tau} trend test, q = 0.031 for all). At the pre-specified primary comparison (bank = 5,000), the improvement was confirmed for the two largest-shift settings (eICU-CRD AKI: {Delta}AUROC = +0.012, q < 0.001; eICU-CRD mortality: {Delta}AUROC = +0.026, q < 0.001). Pre-loading a source bank bridged the cold-start gap, providing an immediate performance gain equivalent to approximately 2,000 to 5,000 local patients. Conclusions PRAM provides a parameter-free adaptation mechanism that requires no model retraining, gradient computation, or regulatory re-evaluation at the deployment site. Effect sizes were modest and did not reach cross-model superiority, but the consistent dose-response pattern and the absence of retraining requirements establish retrieval-based adaptation as a viable approach for clinical model transportability. The retrieval mechanism additionally opens a pathway toward case-based interpretability, where predictions are accompanied by identifiable similar patients from the deploying institution.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
European Heart Journal - Digital Health
15 papers in training set
Top 0.1%
14.9%
2
JAMA Network Open
127 papers in training set
Top 0.1%
10.2%
3
Scientific Reports
3102 papers in training set
Top 6%
10.2%
4
PLOS Digital Health
91 papers in training set
Top 0.2%
8.5%
5
PLOS ONE
4510 papers in training set
Top 25%
6.9%
50% of probability mass above
6
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.4%
6.9%
7
npj Digital Medicine
97 papers in training set
Top 0.7%
6.9%
8
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.1%
4.0%
9
Nature Medicine
117 papers in training set
Top 1%
2.1%
10
The Lancet Digital Health
25 papers in training set
Top 0.4%
1.7%
11
Physiological Measurement
12 papers in training set
Top 0.2%
1.7%
12
Medical Decision Making
10 papers in training set
Top 0.1%
1.5%
13
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.3%
14
Frontiers in Digital Health
20 papers in training set
Top 0.9%
1.2%
15
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.9%
16
JMIRx Med
31 papers in training set
Top 2%
0.8%
17
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
18
JMIR Medical Informatics
17 papers in training set
Top 1%
0.8%
19
BMJ Health & Care Informatics
13 papers in training set
Top 1.0%
0.7%
20
iScience
1063 papers in training set
Top 34%
0.7%
21
Journal of Medical Internet Research
85 papers in training set
Top 5%
0.7%
22
Frontiers in Public Health
140 papers in training set
Top 9%
0.7%
23
Frontiers in Medicine
113 papers in training set
Top 8%
0.7%
24
Journal of Clinical Epidemiology
28 papers in training set
Top 0.7%
0.5%
25
Canadian Medical Association Journal
15 papers in training set
Top 0.5%
0.5%
26
Communications Biology
886 papers in training set
Top 32%
0.5%
27
International Journal of Medical Informatics
25 papers in training set
Top 2%
0.5%
28
Bioinformatics
1061 papers in training set
Top 11%
0.5%
29
Cell Reports Medicine
140 papers in training set
Top 10%
0.5%