Back

The accuracy of polygenic score models for anthropometric traits and Type II Diabetes in the Native Hawaiian Population

Lo, Y.-C.; Chan, T. F.; Jeon, S.; Maskarinec, G.; Taparra, K.; Nakatsuka, N.; Yu, M.; Chen, C.-Y.; Lin, Y.-F.; Wilkens, L. R.; Le Marchand, L.; Haiman, C. A.; Chiang, C. W. K.

2023-12-28 genetic and genomic medicine
10.1101/2023.12.25.23300499 medRxiv
Show abstract

Polygenic scores (PGS) are promising in stratifying individuals based on the genetic susceptibility to complex diseases or traits. However, the accuracy of PGS models, typically trained in European- or East Asian-ancestry populations, tend to perform poorly in other ethnic minority populations, and their accuracies have not been evaluated for Native Hawaiians. Using body mass index, height, and type-2 diabetes as examples of highly polygenic traits, we evaluated the prediction accuracies of PGS models in a large Native Hawaiian sample from the Multiethnic Cohort with up to 5,300 individuals. We evaluated both publicly available PGS models or genome-wide PGS models trained in this study using the largest available GWAS. We found evidence of lowered prediction accuracies for the PGS models in some cases, particularly for height. We also found that using the Native Hawaiian samples as an optimization cohort during training did not consistently improve PGS performance. Moreover, even the best performing PGS models among Native Hawaiians would have lowered prediction accuracy among the subset of individuals most enriched with Polynesian ancestry. Our findings indicate that factors such as admixture histories, sample size and diversity in GWAS can influence PGS performance for complex traits among Native Hawaiian samples. This study provides an initial survey of PGS performance among Native Hawaiians and exposes the current gaps and challenges associated with improving polygenic prediction models for underrepresented minority populations.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Frontiers in Genetics
197 papers in training set
Top 0.1%
22.6%
2
Human Molecular Genetics
130 papers in training set
Top 0.1%
10.5%
3
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
10.1%
4
Cell Genomics
162 papers in training set
Top 0.5%
6.3%
5
Genetic Epidemiology
46 papers in training set
Top 0.1%
4.9%
50% of probability mass above
6
Scientific Reports
3102 papers in training set
Top 23%
4.9%
7
BMC Genomics
328 papers in training set
Top 1%
2.9%
8
European Journal of Human Genetics
49 papers in training set
Top 0.4%
2.6%
9
International Journal of Epidemiology
74 papers in training set
Top 0.9%
2.5%
10
PLOS ONE
4510 papers in training set
Top 46%
2.5%
11
Communications Biology
886 papers in training set
Top 7%
1.8%
12
Human Genetics
25 papers in training set
Top 0.2%
1.5%
13
Genome Medicine
154 papers in training set
Top 5%
1.3%
14
Human Genomics
21 papers in training set
Top 0.2%
1.3%
15
PLOS Computational Biology
1633 papers in training set
Top 18%
1.3%
16
PLOS Genetics
756 papers in training set
Top 10%
1.3%
17
BMC Medical Genomics
36 papers in training set
Top 0.6%
1.3%
18
GENETICS
189 papers in training set
Top 0.8%
1.3%
19
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
20
Bioinformatics
1061 papers in training set
Top 9%
0.9%
21
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
22
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.9%
23
BioData Mining
15 papers in training set
Top 0.7%
0.9%
24
eLife
5422 papers in training set
Top 55%
0.8%
25
Journal of Translational Medicine
46 papers in training set
Top 2%
0.8%
26
Nature Communications
4913 papers in training set
Top 63%
0.7%
27
iScience
1063 papers in training set
Top 34%
0.7%
28
npj Genomic Medicine
33 papers in training set
Top 1%
0.6%
29
Frontiers in Human Neuroscience
67 papers in training set
Top 3%
0.5%