Back

Identifying and ranking novel independent features for cardiovascular disease prediction in people with type 2 diabetes

Dziopa, K.; Chaturvedi, N.; Asselbergs, F.; Schmidt, A. F.

2023-10-23 epidemiology
10.1101/2023.10.23.23297398 medRxiv
Show abstract

BackgroundCVD prediction models do not perform well in people with diabetes. We therefore aimed to identify novel predictors for six facets of CVD, (including coronary heart disease (CHD), Ischemic stroke, heart failure (HF), and atrial fibrillation (AF)) in people with T2DM. MethodsAnalyses were conducted using the UK biobank and were stratified on history of CVD and of T2DM: 459,142 participants without diabetes or a history of CVD, 14,610 with diabetes but without CVD, and 4,432 with diabetes and a history of CVD. Replication was performed using a 20% hold-out set, ranking features on their permuted c-statistic. ResultsOut of the 600+ candidate features, we identified a subset of replicated features, ranging between 32 for CHD in people with diabetes to 184 for CVD+HF+AF in people without diabetes. Classical CVD risk factors (e.g. parental or maternal history of heart disease, or blood pressure) were relatively highly ranked for people without diabetes. The top predictors in the people with diabetes without a CVD history included: cystatin C, self-reported health satisfaction, biochemical measures of ill health (e.g. plasma albumin). For people with diabetes and a history of CVD top features were: self-reported ill health, and blood cell counts measurements (e.g. red cell distribution width). We additionally identified risk factors unique to people with diabetes, consisting of information on dietary patterns, mental health and biochemistry measures. Consideration of these novel features improved risk classification, for example per 1000 people with diabetes 133 CVD and 165 HF cases appropriately received a higher risk. ConclusionThrough data-driven feature selection we identified a substantial number of features relevant for prediction of cardiovascular risk in people with diabetes, the majority of which related to non-classical risk factors such as mental health, general illness markers, and kidney disease.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Diabetologia
36 papers in training set
Top 0.1%
33.9%
2
BMJ Open Diabetes Research & Care
15 papers in training set
Top 0.1%
19.2%
50% of probability mass above
3
Diabetes Care
12 papers in training set
Top 0.1%
7.0%
4
International Journal of Epidemiology
74 papers in training set
Top 0.3%
6.5%
5
Diabetes, Obesity and Metabolism
17 papers in training set
Top 0.1%
3.7%
6
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 0.5%
2.4%
7
Frontiers in Endocrinology
53 papers in training set
Top 0.8%
2.2%
8
eBioMedicine
130 papers in training set
Top 0.9%
1.9%
9
Scientific Reports
3102 papers in training set
Top 54%
1.8%
10
Nature Communications
4913 papers in training set
Top 54%
1.4%
11
PLOS Medicine
98 papers in training set
Top 3%
1.3%
12
Wellcome Open Research
57 papers in training set
Top 2%
0.9%
13
BMC Medicine
163 papers in training set
Top 6%
0.8%
14
BMC Infectious Diseases
118 papers in training set
Top 5%
0.8%
15
Diabetes
53 papers in training set
Top 0.6%
0.8%
16
The Lancet Regional Health - Europe
32 papers in training set
Top 0.4%
0.8%
17
Journal of Clinical Medicine
91 papers in training set
Top 6%
0.8%
18
European Journal of Public Health
20 papers in training set
Top 1%
0.8%
19
British Journal of General Practice
22 papers in training set
Top 0.5%
0.8%
20
PLOS ONE
4510 papers in training set
Top 68%
0.7%
21
Genome Medicine
154 papers in training set
Top 8%
0.7%
22
JAMIA Open
37 papers in training set
Top 2%
0.7%
23
European Journal of Preventive Cardiology
13 papers in training set
Top 1%
0.5%
24
Communications Medicine
85 papers in training set
Top 2%
0.5%