Back

An Optimally Weighted Combination Method to DetectNovel Disease Associated Genes Using Publicly Available GWAS Summary Data

Zhang, J.; Gonzales, S.; Liu, J.; Gao, X. R.; wang, x.

2019-07-20 genetics
10.1101/709808 bioRxiv
Show abstract

Gene-based analyses offer a useful alternative and complement to the usual single nucleotide polymorphism (SNP) based analysis for genome-wide association studies (GWASs). Using appropriate weights (pre-specified or eQTL-derived) can boost statistical power, especially for detecting weak associations between a gene and a trait. Because the sparsity level or association directions of the underlying association patterns in real data are often unknown and access to individual-level data is limited, we propose an optimal weighted combination (OWC) test applicable to summary statistics from GWAS. This method includes burden tests, weighted sum of squared score (SSU), weighted sum statistic (WSS), and the score test as its special cases. We analytically prove that aggregating the variants in one gene is the same as using the weighted combination of Z-scores for each variant based on the score test method. We also numerically illustrate that our proposed test outperforms several existing comparable methods via simulation studies. Lastly, we utilize schizophrenia GWAS data and a fasting glucose GWAS meta-analysis data to demonstrate that our method outperforms the existing methods in real data analyses. Our proposed test is implemented in the R program OWC, which is freely and publicly available.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genetic Epidemiology
46 papers in training set
Top 0.1%
26.0%
2
Bioinformatics
1061 papers in training set
Top 1%
18.7%
3
Biometrics
22 papers in training set
Top 0.1%
6.4%
50% of probability mass above
4
PLOS Genetics
756 papers in training set
Top 3%
4.9%
5
Human Brain Mapping
295 papers in training set
Top 2%
3.6%
6
PLOS ONE
4510 papers in training set
Top 39%
3.6%
7
Genetics
225 papers in training set
Top 1%
3.1%
8
PLOS Computational Biology
1633 papers in training set
Top 11%
3.1%
9
Scientific Reports
3102 papers in training set
Top 47%
2.5%
10
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.4%
11
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.1%
12
Statistics in Medicine
34 papers in training set
Top 0.1%
1.9%
13
BMC Bioinformatics
383 papers in training set
Top 4%
1.8%
14
Physical Review Research
46 papers in training set
Top 0.3%
1.7%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.5%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
1.2%
17
Frontiers in Genetics
197 papers in training set
Top 7%
1.1%
18
iScience
1063 papers in training set
Top 29%
0.8%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 42%
0.8%
20
Nature Communications
4913 papers in training set
Top 62%
0.7%
21
Human Genetics and Genomics Advances
70 papers in training set
Top 0.8%
0.7%
22
The Annals of Applied Statistics
15 papers in training set
Top 0.1%
0.7%
23
International Journal of Epidemiology
74 papers in training set
Top 3%
0.7%
24
G3 Genes|Genomes|Genetics
351 papers in training set
Top 3%
0.6%
25
Human Molecular Genetics
130 papers in training set
Top 4%
0.6%
26
NeuroImage
813 papers in training set
Top 6%
0.6%
27
Communications Biology
886 papers in training set
Top 32%
0.5%