Back

Prediction of Cardiovascular and Renal Complications of Diabetes by a multi-Polygenic Risk Score in Different Ethnic Groups

Kodji, E.; Attaoua, R.; haloui, M.; Hishmih, C.; Seitz, M.; Hamet, P.; Hussin, J.; Tremblay, J.

2025-06-18 genetic and genomic medicine
10.1101/2025.06.17.25329804 medRxiv
Show abstract

We developed a multi-Polygenic risk score (multiPRS) to predict the risk of nephropathy, stroke, and myocardial infarction in people with type 2 diabetes of European descent. The underrepresentation of non-European populations remains a major challenge in genomics research. Objective: To evaluate the ability of our multiPRS model to accurately predict these complications in patients of African and South Asian descents. Method: The multiPRS was developed using 4098 participants with type 2 diabetes of European origin from the ADVANCE trial. Its predictive performance was tested on 17,574 White British, 1,145 South Asian and 749 African participants with type 2 diabetes from the UK Biobank using different machine learning prediction models, including techniques tailored for imbalanced datasets. Results: Globally, linear discriminant analysis and logistic regression had the best performance to predict the risk of nephropathy, stroke, and myocardial infarction in people with type 2 diabetes for the three ethnic groups. Mondrian Cross-Conformal Prediction method when added to logistic regression improved the AUROC values and case detection, particularly in South Asians and Africans, while in White British, performance varied by phenotype. Conclusion: Logistic regression, when used as the underlying model within the Modrian Cross-Conformal Prediction framework, improved the prediction performance, with a confidence level, of diabetes complications and allows better translation of a multiPRS derived from European populations to other ethnic groups.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Diabetologia
36 papers in training set
Top 0.1%
33.7%
2
Scientific Reports
3102 papers in training set
Top 17%
6.5%
3
International Journal of Epidemiology
74 papers in training set
Top 0.3%
5.0%
4
Frontiers in Genetics
197 papers in training set
Top 2%
3.7%
5
Diabetes Care
12 papers in training set
Top 0.1%
3.7%
50% of probability mass above
6
Human Molecular Genetics
130 papers in training set
Top 1%
2.4%
7
PLOS ONE
4510 papers in training set
Top 47%
2.1%
8
Nature Communications
4913 papers in training set
Top 48%
1.9%
9
Journal of Translational Medicine
46 papers in training set
Top 0.6%
1.9%
10
Genome Medicine
154 papers in training set
Top 4%
1.7%
11
eBioMedicine
130 papers in training set
Top 1%
1.7%
12
Bioinformatics
1061 papers in training set
Top 7%
1.7%
13
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
14
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 0.8%
1.5%
15
Journal of Clinical Medicine
91 papers in training set
Top 4%
1.5%
16
Genetic Epidemiology
46 papers in training set
Top 0.5%
1.4%
17
BMC Medical Genomics
36 papers in training set
Top 1.0%
0.9%
18
Frontiers in Pharmacology
100 papers in training set
Top 4%
0.9%
19
Trials
25 papers in training set
Top 1%
0.8%
20
BMJ Open Diabetes Research & Care
15 papers in training set
Top 0.9%
0.8%
21
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.8%
22
eClinicalMedicine
55 papers in training set
Top 2%
0.8%
23
JAMIA Open
37 papers in training set
Top 1%
0.8%
24
Human Mutation
29 papers in training set
Top 0.7%
0.7%
25
International Journal of Environmental Research and Public Health
124 papers in training set
Top 7%
0.7%
26
PLOS Digital Health
91 papers in training set
Top 3%
0.7%
27
JMIR Medical Informatics
17 papers in training set
Top 2%
0.7%
28
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.7%
29
British Journal of General Practice
22 papers in training set
Top 0.6%
0.7%
30
Clinical and Translational Science
21 papers in training set
Top 1%
0.7%