Back

Order-Based Bayesian Network Modeling of Early Detection and Post-Diagnosis Control for Cardiovascular Disease Risk in Type 2 Diabetes

Kathuria, Y.; Miller, K.; Selden, E. B.; Gallagher, W. J.; Capan, M.

2026-06-12 primary care research
10.64898/2026.06.10.26355419 medRxiv
Show abstract

Patients diagnosed with type 2 diabetes (T2D) are at increased risk of developing cardiovascular disease (CVD), the leading cause of morbidity and mortality in this population. Early detection and glycemic control within the first year after diagnosis reduce CVD risk. However, gaps remain in how to operationalize early detection of T2D using Electronic Health Record (EHR) data and quantify its relationship with subsequent CVD risk using longitudinal observations. We developed a probabilistic graph model to analyze the interdependencies between early detection of T2D, post-diagnosis glycemic control, and CVD occurrence. Using a temporally structured Bayesian Network (BN) learned from EHR data of 9,450 primary care patients between 2017 and 2023, we quantified probabilistic dependencies between demographics, diagnostic delay surrogates, glycemic control, and post-diagnosis CVD occurrence. Percentile based thresholds defined risk groups, where individuals with predicted probabilities in the bottom decile ([≤] 10th percentile) were classified as low risk, and those in the top decile ([≥] 90th percentile) as high risk. Results demonstrated heterogeneity in predicted risks across glycemic and cardiovascular outcomes. Predicted probability of developing CVD within the first year after T2D diagnosis ranged from a mean of 5.2% in the low-risk group to 28.9% in the high-risk group, while predicted probabilities of mean Hemoglobin A1c (HbA1c) [≥] 8% during the first year post-diagnosis ranged from 1.6% in low-risk to 55.1% in high-risk group. Patients with HbA1c at diagnosis [≥] 8% had higher predicted probabilities of first-year post-diagnosis mean HbA1c [≥] 8% (53.3% vs. 1.9%) and high HbA1c coefficient of variation (18.7% vs. 3.1%) compared with those with HbA1c [≤] 6.5%. Incorporating early clinical outcomes refined later risk predictions, with long-term CVD risk reaching 33.5% among high-risk individuals. The proposed model achieved predictive performance comparable to conventional machine learning approaches while providing interpretable relationships for risk stratification in primary care populations.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Communications Medicine
85 papers in training set
Top 0.1%
14.6%
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.2%
12.9%
3
JMIR Public Health and Surveillance
45 papers in training set
Top 0.1%
12.6%
4
Scientific Reports
3102 papers in training set
Top 11%
7.3%
5
Advanced Science
249 papers in training set
Top 2%
6.9%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 26%
6.4%
7
eLife
5422 papers in training set
Top 16%
4.9%
8
Journal of Personalized Medicine
28 papers in training set
Top 0.1%
2.4%
9
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
10
JAMIA Open
37 papers in training set
Top 0.7%
1.9%
11
Journal of Medical Internet Research
85 papers in training set
Top 2%
1.9%
12
The Lancet Digital Health
25 papers in training set
Top 0.3%
1.9%
13
npj Digital Medicine
97 papers in training set
Top 2%
1.9%
14
eBioMedicine
130 papers in training set
Top 1%
1.7%
15
Frontiers in Medicine
113 papers in training set
Top 4%
1.5%
16
International Journal of Environmental Research and Public Health
124 papers in training set
Top 5%
1.0%
17
Diabetologia
36 papers in training set
Top 0.9%
0.8%
18
PLOS Digital Health
91 papers in training set
Top 3%
0.8%
19
Frontiers in Public Health
140 papers in training set
Top 8%
0.8%
20
Expert Systems with Applications
11 papers in training set
Top 0.5%
0.7%
21
Healthcare
16 papers in training set
Top 2%
0.7%
22
PeerJ
261 papers in training set
Top 17%
0.7%
23
Nature Communications
4913 papers in training set
Top 66%
0.7%
24
Communications Biology
886 papers in training set
Top 28%
0.7%
25
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.5%
26
PLOS Global Public Health
293 papers in training set
Top 6%
0.5%
27
Science Advances
1098 papers in training set
Top 35%
0.5%