Back

Low-pass Whole Genome Imputation Enables the Characterization of Polygenic Breast Cancer Risk in the Indigenous Arab Population

Al-Jumaan, M.; Chu, H.; Al-Sulaiman, A.; Camp, S. Y.; Han, S.; Gillani, R.; Al Marzooq, Y.; Almulhim, F.; Vatte, C.; Al Nemer, A.; Almuhanna, A.; Van Allen, E. M.; Al-Ali, A.; AlDubayan, S. H.

2022-12-09 genetic and genomic medicine
10.1101/2022.12.07.22282785 medRxiv
Show abstract

The indigenous Arab population has traditionally been underrepresented in cancer genomics studies, and as a result the polygenic risk landscape of breast cancer in the population remains elusive. Here we show by utilizing low-pass whole genome sequencing (lpWGS), we can accurately impute population-specific variants with high exome concordance (median dosage correlation: 0.9459, Interquartile range: 0.9410-0.9490) and construct breast cancer burden-sensitive polygenic risk scores (PRS) using publicly available resources. After adjusting the PRS to the Arab population, we found significant associations between PRS performance in risk prediction and first-degree relative breast cancer history prediction (Spearman rho=0.43, p = 0.03), where breast cancer patients in the top PRS decile are 5.53 (95% CI: 1.76-17.97, p = 0.003) times more likely to also have a first degree relative diagnosed with breast cancer compared to those in the middle deciles. In addition, we found evidence for the genetic liability threshold model of breast cancer where among patients with a family history of breast cancer, pathogenic rare variant carriers had significantly lower PRS than non-carriers (p = 0.0205, M.W.U.) while for non-carriers every standard deviation increase in PRS corresponded to 4.52 years (95% CI: 8.88-0.17, p = 0.042) earlier age of presentation. Overall, our study provides a viable strategy utilizing lpWGS to assess polygenic risk in an understudied population and took steps in addressing existing global health disparities.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Cell Genomics
162 papers in training set
Top 0.1%
44.7%
2
Cell
370 papers in training set
Top 2%
6.8%
50% of probability mass above
3
Genome Medicine
154 papers in training set
Top 1%
5.2%
4
Nature Genetics
240 papers in training set
Top 1%
5.2%
5
Nature Communications
4913 papers in training set
Top 34%
4.7%
6
Science Advances
1098 papers in training set
Top 12%
2.2%
7
Developmental Cell
168 papers in training set
Top 7%
1.9%
8
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.8%
9
iScience
1063 papers in training set
Top 13%
1.8%
10
Scientific Reports
3102 papers in training set
Top 61%
1.6%
11
Frontiers in Genetics
197 papers in training set
Top 5%
1.6%
12
Cell Reports
1338 papers in training set
Top 26%
1.4%
13
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 39%
1.0%
15
Cell Systems
167 papers in training set
Top 10%
1.0%
16
eLife
5422 papers in training set
Top 51%
1.0%
17
Nature Medicine
117 papers in training set
Top 4%
0.8%
18
Cell Death & Disease
126 papers in training set
Top 2%
0.8%
19
Molecular Systems Biology
142 papers in training set
Top 1%
0.8%
20
BMC Medical Genomics
36 papers in training set
Top 1%
0.8%
21
PLOS ONE
4510 papers in training set
Top 65%
0.8%
22
Nature
575 papers in training set
Top 15%
0.8%
23
Molecular Cancer Research
42 papers in training set
Top 0.7%
0.8%
24
Molecular Cell
308 papers in training set
Top 10%
0.8%
25
Clinical Cancer Research
58 papers in training set
Top 2%
0.8%
26
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
27
Human Mutation
29 papers in training set
Top 0.7%
0.8%
28
Nature Human Behaviour
85 papers in training set
Top 5%
0.7%
29
Human Genetics and Genomics Advances
70 papers in training set
Top 1%
0.5%