Back

Interpretable Multimodal Machine Learning Model for Predicting Health Risks of Patients with Heart Failure

Chae, R.; Zhou, J.; Chou, O. H. I.; Yang, B.; Pu, H.; Tse, G.; Cheung, B. M. Y.; Zhu, T.; Car, J.; Lu, L.

2026-01-13 health informatics
10.64898/2026.01.12.26343905
Show abstract

Heart failure (HF) is one of the major causes of morbidity and mortality globally, necessitating accurate tools for health outcome prediction and risk stratification. In this study, we propose an interpretable multimodal machine learning framework integrating four clinical data modalities (i.e., demographics, medications, laboratory tests, and electrocardiograms [ECGs]) to predict 30-day all-cause mortality and hospital readmission in HF patients. Using clinical data from 2,868 HF patients across 43 local hospitals in Hong Kong, we trained and evaluated ten machine learning models for HF risk prediction, with the best performing model achieving an area under the receiver operating characteristic curve (AUC) of 0.881 for mortality and 0.709 for readmission. Notably, laboratory tests and ECG features dominate predictive power, and their combination alone yielded near-optimal results (AUC: 0.872), suggesting that these two modalities may be adequate for effective risk prediction in resource-constrained settings. The SHapley Additive exPlanations (SHAP) analysis identified serum albumin, high-sensitivity troponin I, lactate dehydrogenase, and QT interval dispersion as key predictors. Feature redundancy analysis further revealed strong correlations within laboratory tests and ECG features, suggesting opportunities for model simplification. To the best of our knowledge, this is the first study that comprehensively evaluates diverse configurations of four data modalities for HF risk prediction through ablation analysis, quantifying the marginal gains of each data modality and their combinations. Our findings demonstrate that interpretable multimodal machine learning model can enhance risk prediction in HF patients, supporting personalized management and scalable deployment across diverse healthcare settings.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Scientific Reports
based on 701 papers
Top 10%
12.1%
2
npj Digital Medicine
based on 85 papers
Top 2%
9.7%
3
Journal of Biomedical Informatics
based on 37 papers
Top 1.0%
7.2%
4
PLOS Digital Health
based on 88 papers
Top 2%
7.2%
5
PLOS ONE
based on 1737 papers
Top 60%
7.2%
6
BMC Medical Informatics and Decision Making
based on 36 papers
Top 2%
6.1%
7
JMIR Medical Informatics
based on 16 papers
Top 1%
4.5%
50% of probability mass above
8
Journal of the American Medical Informatics Association
based on 53 papers
Top 3%
4.3%
9
JAMIA Open
based on 35 papers
Top 3%
4.3%
10
Computers in Biology and Medicine
based on 39 papers
Top 1%
4.3%
11
Journal of Medical Internet Research
based on 81 papers
Top 6%
2.7%
12
International Journal of Medical Informatics
based on 25 papers
Top 2%
2.7%
13
European Heart Journal - Digital Health
based on 15 papers
Top 1%
2.2%
14
Nature Communications
based on 483 papers
Top 32%
1.5%
15
Frontiers in Artificial Intelligence
based on 11 papers
Top 1%
1.5%
16
Patterns
based on 15 papers
Top 2%
1.3%
17
BMJ Health & Care Informatics
based on 13 papers
Top 2%
1.3%
18
PLOS Computational Biology
based on 141 papers
Top 8%
1.1%
19
Nature Medicine
based on 88 papers
Top 13%
1.1%
20
The Lancet Digital Health
based on 25 papers
Top 5%
0.8%
21
IEEE Journal of Biomedical and Health Informatics
based on 14 papers
Top 3%
0.8%
22
BMC Medical Research Methodology
based on 41 papers
Top 7%
0.6%
23
JMIR Public Health and Surveillance
based on 45 papers
Top 14%
0.6%