Back

Condition-Specific Readmission Risk Stratification in a Predominantly Black Statewide Cohort Using Machine Learning: Development of Subtype-Specific Models for Heart Failure, Acute Myocardial Infarction, Atrial Fibrillation/Flutter, and Hypertensive Heart Disease

EL Moudden, I.; Bittner, M.; Dodani, S.

2026-03-09 cardiovascular medicine
10.64898/2026.03.08.26347901 medRxiv
Show abstract

BackgroundCardiovascular disease (CVD) readmissions impose substantial clinical and economic burden. Machine learning (ML) may improve risk stratification, yet most predictive models aggregate CVD subtypes into a single outcome and underrepresent Black populations. MethodsUsing Virginia Health Information database records (2010 to 2020), we analyzed 157,791 discharge records from 123,272 unique patients (96.6% Black) to develop condition-specific 30-day readmission models for heart failure (HF; n=91,752), acute myocardial infarction (AMI; n=34,497), atrial fibrillation/flutter (AF/AFL; n=18,424), and hypertensive heart disease (HHD; n=13,118). Four algorithms (XGBoost, LightGBM, Random Forest, Elastic Net) plus a Super Learner ensemble were trained on patient-grouped 70/30 splits with and without Synthetic Minority Oversampling Technique balancing. Models incorporated validated clinical indices (LACE, Charlson, Elixhauser) and administrative social determinants of health proxies. ResultsThe overall 30-day readmission rate was 18.9%. Best area under the receiver operating characteristic curve (AUC) values by condition were HF 0.708 (95% CI, 0.701 to 0.716), AMI 0.706 (95% CI, 0.691 to 0.721), AF/AFL 0.732 (95% CI, 0.715 to 0.750), and HHD 0.758 (95% CI, 0.735 to 0.777). XGBoost was the top-performing algorithm for three of four subtypes. The LACE Index, Charlson Comorbidity Index, and insurance type were consistently the strongest predictors. Algorithm-native, aggregated, and SHAP-based importance measures converged on these key features. ConclusionsIn this largest-to-date, predominantly Black statewide cohort, condition-specific ML models achieved moderate-to-high discrimination for HF, AMI, AF/AFL, and HHD. Key clinical indices and administrative social determinants proxies emerged as dominant predictors, highlighting modifiable targets and high-risk subgroups. These findings support the development of precision, equity-informed readmission interventions and provide a scalable framework for deploying ML-driven decision support in safety-net and minority-serving healthcare systems. WHAT IS KNOWN* Machine learning models for cardiovascular readmission prediction have largely aggregated disease subtypes and underrepresented Black populations. * Most existing studies lack head-to-head algorithm comparisons within racially concentrated cohorts and omit social determinants of health proxies. WHAT THE STUDY ADDS* Condition-specific models for four cardiovascular subtypes achieved moderate-to-high discrimination (AUC 0.690 to 0.706) in the largest machine learning-based analysis of a predominantly Black statewide cohort. * Validated clinical indices (LACE, Charlson) and insurance type consistently emerged as dominant predictors, identifying modifiable targets for equity-informed intervention. * The scalable, administrative-data-only framework supports deployment of subtype-specific readmission decision support in safety-net and minority-serving health systems.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Circulation
66 papers in training set
Top 0.2%
14.2%
2
European Heart Journal - Digital Health
15 papers in training set
Top 0.1%
9.0%
3
The Lancet Digital Health
25 papers in training set
Top 0.1%
6.7%
4
Journal of the American Heart Association
119 papers in training set
Top 1%
6.3%
5
The American Journal of Cardiology
15 papers in training set
Top 0.4%
6.2%
6
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.4%
4.8%
7
PLOS ONE
4510 papers in training set
Top 32%
4.8%
50% of probability mass above
8
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.8%
3.5%
9
npj Digital Medicine
97 papers in training set
Top 1%
2.8%
10
Scientific Reports
3102 papers in training set
Top 47%
2.4%
11
Heart
10 papers in training set
Top 0.4%
2.3%
12
American Journal of Preventive Medicine
11 papers in training set
Top 0.2%
2.1%
13
BMJ
49 papers in training set
Top 0.4%
2.1%
14
BMC Medicine
163 papers in training set
Top 3%
1.9%
15
Epidemiology
26 papers in training set
Top 0.2%
1.8%
16
JAMA Network Open
127 papers in training set
Top 2%
1.7%
17
BMJ Health & Care Informatics
13 papers in training set
Top 0.4%
1.7%
18
PLOS Medicine
98 papers in training set
Top 3%
1.7%
19
Circulation: Heart Failure
14 papers in training set
Top 0.3%
1.6%
20
Journal of the American College of Cardiology
12 papers in training set
Top 0.4%
1.5%
21
European Journal of Preventive Cardiology
13 papers in training set
Top 0.7%
1.2%
22
JMIR Medical Informatics
17 papers in training set
Top 1%
1.2%
23
Nature Communications
4913 papers in training set
Top 57%
1.1%
24
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.9%
25
Canadian Medical Association Journal
15 papers in training set
Top 0.2%
0.9%
26
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.6%
0.9%
27
International Journal of Epidemiology
74 papers in training set
Top 2%
0.9%
28
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 2%
0.8%
29
BMJ Open
554 papers in training set
Top 13%
0.7%
30
Journal of the American Geriatrics Society
12 papers in training set
Top 0.2%
0.7%