Back

Comparing ancestry calibration approaches for a trans-ancestry colorectal cancer polygenic risk score

Rosenthal, E. A.; Hsu, L.; Thomas, M.; Peters, U.; Kachulis, C.; Patterson, K.; jarvik, G. P.

2023-10-24 genetic and genomic medicine
10.1101/2023.10.23.23296753
Show abstract

BackgroundColorectal cancer (CRC) is a complex disease with monogenic, polygenic and environmental risk factors. Polygenic risk scores (PRS) are being developed to identify high polygenic risk individuals. Due to differences in genetic background, PRS distributions vary by ancestry, necessitating calibration. MethodsWe compared four calibration methods using the All of Us Research Program Whole Genome Sequence data for a CRC PRS previously developed in participants of European and East Asian ancestry. The methods contrasted results from linear models with A) the entire data set or an ancestrally diverse training set AND B) covariates including principal components of ancestry or admixture. Calibration with the training set adjusted the variance in addition to the mean. ResultsAll methods performed similarly within ancestry with OR (95% C.I.) per s.d. change in PRS: African 1.5 (1.02, 2.08), Admixed American 2.2 (1.27, 3.85), European 1.6 (1.43, 1.89), and Middle Eastern 1.1 (0.71, 1.63). Using admixture and an ancestrally diverse training set provided distributions closest to standard Normal with accurate upper tail frequencies. ConclusionAlthough the PRS is predictive of CRC risk for most ancestries, its performance varies by ancestry. Post-hoc calibration preserves the risk prediction within ancestries. Training a calibration model on ancestrally diverse participants to adjust both the mean and variance of the PRS, using admixture as covariates, created standard Normal z-scores. These z-scores can be used to identify patients at high polygenic risk, and can be incorporated into comprehensive risk scores including other known risk factors, allowing for more precise risk estimates.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
International Journal of Cancer
based on 18 papers
Top 0.1%
12.1%
2
Cancer Epidemiology, Biomarkers & Prevention
based on 14 papers
Top 0.1%
9.7%
3
The American Journal of Human Genetics
based on 77 papers
Top 2%
9.7%
4
Genetics in Medicine
based on 57 papers
Top 2%
4.8%
5
Cancers
based on 57 papers
Top 4%
4.3%
6
eLife
based on 262 papers
Top 6%
4.3%
7
Scientific Reports
based on 701 papers
Top 45%
4.3%
8
Nature Communications
based on 483 papers
Top 23%
2.7%
50% of probability mass above
9
Genome Medicine
based on 56 papers
Top 4%
2.3%
10
PLOS ONE
based on 1737 papers
Top 86%
2.2%
11
JAMA Network Open
based on 125 papers
Top 11%
1.5%
12
European Journal of Human Genetics
based on 25 papers
Top 1%
1.5%
13
Gut
based on 17 papers
Top 2%
1.3%
14
PLOS Genetics
based on 39 papers
Top 4%
1.3%
15
International Journal of Epidemiology
based on 65 papers
Top 6%
1.3%
16
Human Molecular Genetics
based on 28 papers
Top 3%
1.3%
17
Genes
based on 21 papers
Top 2%
1.2%
18
Human Genetics and Genomics Advances
based on 39 papers
Top 3%
1.1%
19
Nature Genetics
based on 72 papers
Top 8%
1.1%
20
Journal of Medical Genetics
based on 22 papers
Top 2%
1.1%
21
The Journal of Molecular Diagnostics
based on 24 papers
Top 2%
0.8%
22
Human Genomics
based on 13 papers
Top 2%
0.8%
23
JNCI: Journal of the National Cancer Institute
based on 13 papers
Top 2%
0.8%
24
Human Mutation
based on 14 papers
Top 2%
0.8%
25
British Journal of Cancer
based on 22 papers
Top 4%
0.8%
26
Breast Cancer Research
based on 11 papers
Top 2%
0.6%
27
Nature Medicine
based on 88 papers
Top 20%
0.6%
28
Nature
based on 58 papers
Top 11%
0.6%
29
BMC Medical Genomics
based on 12 papers
Top 2%
0.6%