Back

Socioeconomic and Behavioral Drivers of Geographic Disparities in U.S. Cardiovascular Mortality: A Machine Learning Analysis

Khan, L.; Khan, M.; Ahmad, M.; Lac, J.

2025-09-15 cardiovascular medicine
10.1101/2025.09.13.25334113 medRxiv
Show abstract

BackgroundSubstantial geographic disparities in cardiovascular disease (CVD) mortality persist across the United States. The extent to which "place" reflects underlying socioeconomic and behavioral risk factors remains insufficiently explained. This study applies machine learning to quantify the determinants of these disparities. MethodsA cross-sectional analysis linked county-level 2019-2020 age-adjusted CVD mortality rates from the CDC with health determinant metrics from the 2023 County Health Rankings dataset. The analytic sample included [N counties] with complete data. A Random Forest regressor modeled mortality outcomes, incorporating socioeconomic, healthcare access, and behavioral predictors. Model interpretation used SHAP to assess feature-level contributions. ResultsThe model explained [R2 value] of variance in CVD mortality. Socioeconomic factors, particularly median household income and poverty rates, were the most influential predictors, followed by county-level smoking prevalence. Geographic identifiers alone had limited explanatory value after accounting for socioeconomic and behavioral metrics. ConclusionsGeographic disparities in CVD mortality are explained by underlying socioeconomic disadvantage and community health behaviors. Effective reduction of disparities requires public health interventions addressing poverty, education, and behavioral risk factors beyond clinical care. What Is New?Explanatory vs. Predictive Modeling: Previous research has largely focused on identifying geographic disparities in cardiovascular disease (CVD) mortality. This study goes further by not only predicting mortality but also explaining why disparities exist, quantifying the relative importance of socioeconomic, behavioral, and healthcare access determinants. Advanced InterpretationWe apply SHAP (SHapley Additive exPlanations), an advanced interpretability framework in machine learning, to measure precisely the effect of each county-level characteristic on mortality, uncovering complex patterns and interactions. Integrated Data ApproachBy combining recent granular datasets on health outcomes, socioeconomic context, and behaviors, this study produces a multi-domain explanatory model of CVD mortality drivers at the national scale. Clinical ImplicationsFindings show that clinical interventions alone are insufficient to eliminate disparities in CVD mortality, since the most powerful predictors are upstream social determinants of health. This evidence supports the need for clinicians and health systems to partner in policies that address economic stability, educational access, and environments conducive to healthier behaviors. Strategic targeting of resources toward communities with high poverty and low educational attainment may yield more effective and equitable reductions in the national CVD burden compared to approaches focused only on clinical care.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of the American Heart Association
119 papers in training set
Top 0.2%
18.3%
2
Epidemiology
26 papers in training set
Top 0.1%
12.2%
3
American Journal of Preventive Medicine
11 papers in training set
Top 0.1%
12.2%
4
PLOS ONE
4510 papers in training set
Top 35%
4.1%
5
The American Journal of Cardiology
15 papers in training set
Top 0.6%
3.9%
50% of probability mass above
6
The Lancet Digital Health
25 papers in training set
Top 0.1%
3.9%
7
Scientific Reports
3102 papers in training set
Top 38%
3.5%
8
PLOS Medicine
98 papers in training set
Top 1%
3.0%
9
BMJ Open
554 papers in training set
Top 7%
2.7%
10
BMJ Health & Care Informatics
13 papers in training set
Top 0.2%
2.6%
11
JAMA Network Open
127 papers in training set
Top 2%
2.0%
12
International Journal of Epidemiology
74 papers in training set
Top 1%
1.9%
13
Journal of Racial and Ethnic Health Disparities
11 papers in training set
Top 0.2%
1.7%
14
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.7%
15
Circulation
66 papers in training set
Top 2%
1.6%
16
BMC Medicine
163 papers in training set
Top 4%
1.5%
17
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.3%
18
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.9%
1.3%
19
Canadian Medical Association Journal
15 papers in training set
Top 0.2%
1.2%
20
PNAS Nexus
147 papers in training set
Top 0.9%
0.9%
21
International Journal of Environmental Research and Public Health
124 papers in training set
Top 6%
0.9%
22
Heart
10 papers in training set
Top 0.8%
0.9%
23
SSM - Population Health
17 papers in training set
Top 0.4%
0.8%
24
Cureus
67 papers in training set
Top 5%
0.8%
25
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.8%
26
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 3%
0.7%
27
JMIR Public Health and Surveillance
45 papers in training set
Top 4%
0.7%
28
BMC Public Health
147 papers in training set
Top 6%
0.7%
29
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.7%
30
Nature Communications
4913 papers in training set
Top 64%
0.7%