Back

Two-step deep-learning candidemia prediction model using two large time-sequence electronic health datasets

Yoshida, H.; Adelman, M. W.; Rasmy, L.; Ifiora, F.; Xie, Z.; Perez, M. A.; Guerra, F.; Yoshimura, H.; Jones, S. L.; Arias, C. A.; Zhi, D.; Nigo, M.

2026-03-04 infectious diseases
10.64898/2026.03.03.26347531 medRxiv
Show abstract

BackgroundCandidemia is a rare but life-threatening bloodstream infection that remains difficult to predict using conventional risk stratification approaches, highlighting the need for improved predictive strategies. As a result, empiric antifungal therapy is often delayed even in high-risk patients. MethodsWe developed a deep learning model (PyTorch_EHR) to predict 7-day candidemia risk by using electronic health record data from two large cohorts (Houston Methodist Hospital System [HMHS] and MIMIC-IV), including adult inpatients who underwent at least one blood culture. Model performance was compared with logistic regression (LR), LightGBM, and established intensive care unit candidemia scores. We further implemented a two-step prediction framework integrating candidemia and 30-day mortality risk models to inform empiric antifungal decision-making. ResultsAmong 213,404 and 107,507 patients in the HMHS and MIMIC-IV cohorts, candidemia occurred in fewer than 1% (851 [0.4%] and 634 [0.6%], respectively). PyTorch_EHR outperformed LR, LightGBM, and existing candidemia scores, particularly in terms of area under the precision-recall curve (AUPRC) in HMHS and MIMIC-IV. By integrating 30-day mortality risk, the two-step framework identified an additional 20 and 28 candidemia cases beyond the one-step model, increasing coverage to 61% (121/199) and 46% (68/147) in HMHS and MIMIC-IV, respectively. Many patients identified by the two-step framework had high mortality yet did not receive empiric antifungal therapy (61.1% HMHS; 82.6% MIMIC-IV). ConclusionA two-step deep-learning framework integrating candidemia and mortality risk may support early identification of high-risk patients and facilitate timely empiric antifungal therapy. Prospective studies are warranted to confirm the findings.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Clinical Infectious Diseases
231 papers in training set
Top 0.1%
19.5%
2
The Journal of Infectious Diseases
182 papers in training set
Top 0.3%
8.4%
3
Open Forum Infectious Diseases
134 papers in training set
Top 0.1%
6.4%
4
Journal of Infection
71 papers in training set
Top 0.2%
6.3%
5
Journal of Clinical Microbiology
120 papers in training set
Top 0.4%
4.9%
6
PLOS ONE
4510 papers in training set
Top 34%
4.3%
7
BMC Infectious Diseases
118 papers in training set
Top 1.0%
3.7%
50% of probability mass above
8
Journal of Medical Internet Research
85 papers in training set
Top 1%
3.6%
9
Nature Communications
4913 papers in training set
Top 43%
2.9%
10
Infection Control & Hospital Epidemiology
17 papers in training set
Top 0.1%
2.1%
11
Clinical Microbiology and Infection
60 papers in training set
Top 0.5%
1.7%
12
Scientific Reports
3102 papers in training set
Top 58%
1.7%
13
JAC-Antimicrobial Resistance
13 papers in training set
Top 0.2%
1.7%
14
American Journal of Infection Control
12 papers in training set
Top 0.1%
1.5%
15
The Lancet Microbe
43 papers in training set
Top 0.8%
1.3%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.3%
17
Clinical Chemistry
22 papers in training set
Top 0.5%
1.3%
18
PLOS Computational Biology
1633 papers in training set
Top 21%
1.0%
19
npj Digital Medicine
97 papers in training set
Top 3%
0.9%
20
BMC Medicine
163 papers in training set
Top 6%
0.9%
21
Annals of Internal Medicine
27 papers in training set
Top 0.8%
0.8%
22
PLOS Medicine
98 papers in training set
Top 4%
0.8%
23
Eurosurveillance
80 papers in training set
Top 1%
0.8%
24
Genome Medicine
154 papers in training set
Top 8%
0.8%
25
Emerging Infectious Diseases
103 papers in training set
Top 3%
0.7%
26
Journal of Clinical Investigation
164 papers in training set
Top 7%
0.7%
27
Frontiers in Medicine
113 papers in training set
Top 8%
0.6%
28
PLOS Digital Health
91 papers in training set
Top 3%
0.6%
29
Tropical Medicine & International Health
15 papers in training set
Top 0.9%
0.6%
30
Communications Medicine
85 papers in training set
Top 2%
0.6%