Back

Improving the Prediction of Unplanned 30-day Cancer Readmissions Using Social Determinants of Health: A Geocoding-based Approach

Bindhu, S.; Wu, T.-C.; Shih, H.; Chintalapalli, H.; Liu, H.; Wells, A.; Morrison, C. F.; Hsu, W.-W.; Wu, D. T.

2025-09-02 health informatics
10.1101/2025.08.31.25334806 medRxiv
Show abstract

Unplanned cancer readmissions present a significant burden on patients and hospitals. Current predictive models often overlook socioeconomic factors such as social determinants of health (SDoH), which have the potential to improve prediction performance, as measured by the Area Under the Receiver Operating Characteristic (AUROC) and the Precision-Recall Curve (AUPRC). To investigate this, the present study developed predictive models using cancer readmission data from a large health system in Hamilton County, OH. The models incorporated geocoding-based SDoH along with clustering techniques and compared machine learning (ML) and deep learning (DL) algorithms. Overall, models, regardless of algorithm type, not using SDoH variables had higher AUROC and AUPRCs. The best-performing ML and DL models are comparable (AUROC = 0.7605 for ML; AUROC = 0.7585 for DL). However, when top-performing models were evaluated across certain organ and system cancers, using SDoH and clustering techniques significantly improved model performance. This was most notable for cancers of the skin, subcutaneous tissue, and breast with improvements of 8.20% in AUROC and 11.04% in AUPRC. For all cancer patient cases, utilizing individualized SDoH information extracted from clinical notes was recommended for future studies.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.1%
18.0%
2
JMIR Medical Informatics
17 papers in training set
Top 0.1%
17.6%
3
Journal of Biomedical Informatics
45 papers in training set
Top 0.2%
6.6%
4
JAMIA Open
37 papers in training set
Top 0.2%
6.6%
5
PLOS ONE
4510 papers in training set
Top 29%
6.1%
50% of probability mass above
6
International Journal of Medical Informatics
25 papers in training set
Top 0.3%
4.7%
7
Journal of Medical Internet Research
85 papers in training set
Top 1%
4.2%
8
BMJ Health & Care Informatics
13 papers in training set
Top 0.2%
3.5%
9
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.8%
3.1%
10
Scientific Reports
3102 papers in training set
Top 42%
3.0%
11
BMC Medical Research Methodology
43 papers in training set
Top 0.4%
2.6%
12
JMIR Public Health and Surveillance
45 papers in training set
Top 1%
2.5%
13
BMJ Open
554 papers in training set
Top 8%
1.8%
14
PLOS Digital Health
91 papers in training set
Top 2%
1.6%
15
Frontiers in Public Health
140 papers in training set
Top 5%
1.6%
16
npj Digital Medicine
97 papers in training set
Top 2%
1.6%
17
Cancer Medicine
24 papers in training set
Top 1%
0.9%
18
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.7%
0.9%
19
BMC Infectious Diseases
118 papers in training set
Top 5%
0.8%
20
Artificial Intelligence in Medicine
15 papers in training set
Top 0.7%
0.8%
21
The Lancet Digital Health
25 papers in training set
Top 1%
0.8%
22
PeerJ
261 papers in training set
Top 14%
0.8%
23
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 1%
0.7%
24
JAMA Network Open
127 papers in training set
Top 5%
0.7%
25
Epidemiology and Infection
84 papers in training set
Top 4%
0.6%
26
JMIRx Med
31 papers in training set
Top 2%
0.6%