Back

Explicitly modeling genetic ancestry to improve polygenic prediction accuracy for height in a large, admixed cohort of US Latinos: Findings from HCHS/SOL

Wang, X.; Sofer, T.; Frei, O.; Kaplan, R.; Perreira, K. M.; Franceschini, N.; Parada, H.; Zhou, L.; Andreassen, O. A.; Gonzalez, H.; Dale, A. M.; Broce, I. J.

2025-03-23 genetic and genomic medicine
10.1101/2025.03.21.25324423 medRxiv
Show abstract

Polygenic scores (PGS) offer moderate to high prediction accuracy for complex traits, but most are developed in European ancestry cohorts, reducing their performance in populations of other ancestries. This study aimed to improve standing height prediction, a heritable and ancestry-influenced trait, in an admixed Latino cohort (HCHS/SOL) by modeling ancestry using principal components (PCs) alongside PGS. SNPs were selected from a large European ancestry GWAS using various p-value thresholds, and weights were trained using traditional and penalized regression in the UK Biobank (UKB). PGS with PCs were trained separately in HCHS/SOL and UKB. Compared to PGS alone, modeling PGS with PCs substantially improved height prediction in HCHS/SOL (R{superscript 2} increase of [~]0.1), while mild improvements were observed in UKB (R{superscript 2} increase of [~]0.01). These results underscore the importance of incorporating genetic ancestry into predictive models for admixed populations, particularly when the trait exhibits ancestry-specific associations.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Frontiers in Genetics
197 papers in training set
Top 0.1%
23.1%
2
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
12.8%
3
Scientific Reports
3102 papers in training set
Top 26%
4.4%
4
Human Molecular Genetics
130 papers in training set
Top 0.5%
4.4%
5
PLOS Genetics
756 papers in training set
Top 3%
4.3%
6
European Journal of Human Genetics
49 papers in training set
Top 0.2%
4.3%
50% of probability mass above
7
Cell Genomics
162 papers in training set
Top 1%
4.1%
8
GENETICS
189 papers in training set
Top 0.2%
3.7%
9
The American Journal of Human Genetics
206 papers in training set
Top 1%
2.9%
10
Genes
126 papers in training set
Top 0.4%
2.8%
11
PLOS ONE
4510 papers in training set
Top 44%
2.7%
12
Genetic Epidemiology
46 papers in training set
Top 0.3%
2.1%
13
Nature Communications
4913 papers in training set
Top 50%
1.7%
14
Journal of Clinical Medicine
91 papers in training set
Top 4%
1.5%
15
Nature Human Behaviour
85 papers in training set
Top 3%
1.4%
16
Behavior Genetics
15 papers in training set
Top 0.1%
1.3%
17
Developmental Cognitive Neuroscience
81 papers in training set
Top 0.4%
1.1%
18
Journal of Medical Genetics
28 papers in training set
Top 0.4%
1.1%
19
International Journal of Epidemiology
74 papers in training set
Top 2%
0.9%
20
eLife
5422 papers in training set
Top 55%
0.8%
21
Aging
69 papers in training set
Top 3%
0.8%
22
Communications Biology
886 papers in training set
Top 23%
0.8%
23
Evolution
199 papers in training set
Top 2%
0.7%
24
Human Genetics
25 papers in training set
Top 0.5%
0.7%
25
Peer Community Journal
254 papers in training set
Top 4%
0.7%
26
Nature Genetics
240 papers in training set
Top 8%
0.7%
27
Clinical and Translational Medicine
30 papers in training set
Top 1%
0.7%
28
Medicine & Science in Sports & Exercise
15 papers in training set
Top 0.5%
0.7%
29
Human Genomics
21 papers in training set
Top 0.5%
0.5%