Back

Pathway Polygenic Risk Scores (pPRS) for the Analysis of Gene-environment Interaction

Gauderman, W. J.; Fu, Y.; Quem, B.; Kawaguchi, E.; Wang, Y.; Morrison, J.; Brenner, H.; Chan, A.; Gruber, S.; Temitope, K.; Li, L.; Moreno, V.; Pellatt, A.; Peters, U.; Samadder, N. J.; Schmit, S.; Ulrich, C.; Um, C.; Wu, A.; Lewinger, J. P.; Mi, H.; Drew, D.

2024-12-20 genetics
10.1101/2024.12.16.628610 bioRxiv
Show abstract

A polygenic risk score (PRS) is used to quantify the combined disease risk of many genetic variants. For complex human traits there is interest in determining whether the PRS modifies, i.e. interacts with, important environmental (E) risk factors. Detection of a PRS by environment (PRS x E) interaction may provide clues to underlying biology and can be useful in developing targeted prevention strategies for modifiable risk factors. The standard PRS may include a subset of variants that interact with E but a much larger subset of variants that affect disease without regard to E. This latter subset will water down the underlying signal in former subset, leading to reduced power to detect PRS x E interaction. We explore the use of pathway-defined PRS (pPRS) scores, using state of the art tools to annotate subsets of variants to genomic pathways. We demonstrate via simulation that testing targeted pPRS x E interaction can yield substantially greater power than testing overall PRS x E interaction. We also analyze a large study (N=78,253) of colorectal cancer (CRC) where E = non-steroidal anti-inflammatory drugs (NSAIDs), a well-established protective exposure. While no evidence of overall PRS x NSAIDs interaction (p=0.41) is observed, a significant pPRS x NSAIDs interaction (p=0.0003) is identified based on SNPs within the TGF-{beta} / gonadotropin releasing hormone receptor (GRHR) pathway. NSAIDS is protective (OR=0.84) for those at the 5th percentile of the TGF-{beta}/GRHR pPRS (low genetic risk, OR), but significantly more protective (OR=0.70) for those at the 95th percentile (high genetic risk). From a biological perspective, this suggests that NSAIDs may act to reduce CRC risk specifically through genes in these pathways. From a population health perspective, our result suggests that focusing on genes within these pathways may be effective at identifying those for whom NSAIDs-based CRC-prevention efforts may be most effective. Author SummaryThe identification of polygenic risk score (PRS) by environment (PRSxE) interactions may provide clues to underlying biology and facilitate targeted disease prevention strategies. The standard approach to computing a PRS likely includes many variants that affect disease without regard to E, reducing power to detect PRS x E interactions. We utilize gene annotation tools to develop pathway-based PRS (pPRS) scores and show by simulation studies that testing pPRS x E interaction can yield substantially greater power than testing PRS x E, while also integrating biological knowledge into the analysis. We apply our method to a large study of colorectal cancer to identify a significant pPRS x NSAIDs interaction (p=0.0003) based on SNPs within the TGF-{beta} / gonadotropin releasing hormone receptor (GRHR) pathway. Our findings suggest that focusing on genetic susceptibility within biologically informed pathways may be more sensitive for identifying exposures that can be considered as part of a precision prevention approach.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genetic Epidemiology
46 papers in training set
Top 0.1%
25.5%
2
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.2%
8.3%
3
BMC Bioinformatics
383 papers in training set
Top 2%
6.3%
4
Frontiers in Genetics
197 papers in training set
Top 0.9%
6.2%
5
PLOS Computational Biology
1633 papers in training set
Top 6%
6.2%
50% of probability mass above
6
Bioinformatics
1061 papers in training set
Top 4%
6.2%
7
PLOS ONE
4510 papers in training set
Top 34%
4.2%
8
Scientific Reports
3102 papers in training set
Top 38%
3.5%
9
GENETICS
189 papers in training set
Top 0.3%
3.5%
10
PLOS Genetics
756 papers in training set
Top 6%
2.3%
11
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.7%
12
Genetics
225 papers in training set
Top 2%
1.6%
13
International Journal of Epidemiology
74 papers in training set
Top 2%
1.5%
14
BMC Cancer
52 papers in training set
Top 2%
1.3%
15
European Journal of Human Genetics
49 papers in training set
Top 1%
0.9%
16
Physiological Reports
35 papers in training set
Top 1%
0.8%
17
Physiological Genomics
15 papers in training set
Top 0.3%
0.8%
18
Physical Biology
43 papers in training set
Top 2%
0.8%
19
G3: Genes, Genomes, Genetics
222 papers in training set
Top 1%
0.7%
20
Statistics in Medicine
34 papers in training set
Top 0.4%
0.7%
21
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.8%
0.7%
22
BioData Mining
15 papers in training set
Top 1%
0.7%
23
BMC Genomics
328 papers in training set
Top 7%
0.6%
24
Journal of Medical Genetics
28 papers in training set
Top 0.7%
0.6%
25
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.6%
26
Genes
126 papers in training set
Top 4%
0.6%
27
Archives of Clinical and Biomedical Research
28 papers in training set
Top 3%
0.6%