Back

Within-Family GWAS does not Ameliorate the Decline in Prediction Accuracy across Populations

Zhang, L.; Conley, D.

2024-12-13 genomics
10.1101/2024.12.12.628188 bioRxiv
Show abstract

As polygenic prediction extends beyond the research domain to involve clinical applications, the urgency of solving the "portability problem" becomes amplified--that is, the fact that polygenic indices (PGI) constructed based on discovery analysis in one population (typically of exclusively continental European descent) predict poorly in other populations. In the present paper we test whether population differences in genetic nurture, assortative mating, or population stratification contribute to the fact that polygenic indices constructed based on GWAS results from European-descent samples predict more poorly in admixed populations with Native American and African ancestry. We do this by comparing the rates of decline in prediction accuracy of classical-GWAS-based PGIs versus within-family-based PGIs, each estimated in a population of European descent, as they are deployed in two samples of Latino Americans and African Americans. Within-family GWAS putatively eliminates the effects of parental genetic nurture, assortative mating, and population stratification; thus, we can determine whether without those confounding factors in the PGI construction, the relative prediction accuracy in the out-groups is ameliorated. Results show that relative prediction accuracy is not improved, suggesting that the differences across groups can be almost entirely explained by variation in genetic architecture (i.e. allele frequencies and short-range LD) rather than the aforementioned factors. Additional analysis of the impact of genetic architecture on the decline in prediction accuracy supports this conclusion. Future researchers should test within-family analysis at the prediction rather than the discovery stage.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Frontiers in Genetics
197 papers in training set
Top 0.1%
28.2%
2
Scientific Reports
3102 papers in training set
Top 12%
7.3%
3
Genetic Epidemiology
46 papers in training set
Top 0.1%
6.5%
4
PLOS ONE
4510 papers in training set
Top 30%
4.9%
5
European Journal of Human Genetics
49 papers in training set
Top 0.2%
4.9%
50% of probability mass above
6
Behavior Genetics
15 papers in training set
Top 0.1%
3.7%
7
PLOS Genetics
756 papers in training set
Top 4%
3.7%
8
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
2.8%
9
BMC Genomics
328 papers in training set
Top 1%
2.7%
10
Bioinformatics
1061 papers in training set
Top 6%
2.1%
11
Genes
126 papers in training set
Top 0.6%
2.1%
12
GENETICS
189 papers in training set
Top 0.5%
1.8%
13
Human Molecular Genetics
130 papers in training set
Top 1%
1.8%
14
Genome Biology and Evolution
280 papers in training set
Top 1.0%
1.7%
15
BMC Bioinformatics
383 papers in training set
Top 5%
1.5%
16
PLOS Computational Biology
1633 papers in training set
Top 19%
1.2%
17
Peer Community Journal
254 papers in training set
Top 3%
1.1%
18
Evolutionary Applications
91 papers in training set
Top 1.0%
0.9%
19
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
20
Human Genomics
21 papers in training set
Top 0.3%
0.9%
21
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
22
Journal of Personalized Medicine
28 papers in training set
Top 0.9%
0.9%
23
Heredity
53 papers in training set
Top 0.3%
0.8%
24
PeerJ
261 papers in training set
Top 15%
0.8%
25
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.8%
26
Genetics
225 papers in training set
Top 5%
0.5%
27
F1000Research
79 papers in training set
Top 6%
0.5%
28
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 12%
0.5%
29
Cell Genomics
162 papers in training set
Top 8%
0.5%
30
Gene
41 papers in training set
Top 3%
0.5%