Back

eMAGMA: An eQTL-informed method to identify risk genes using genome-wide association study summary statistics

Gerring, Z. F.; Mina-Vargas, A.; Derks, E. M.

2019-11-25 genetics
10.1101/854315 bioRxiv
Show abstract

Identifying genes underlying genetic associations of complex disease is challenging because most common risk variants reside in non-protein coding regions of the genome and likely alter the expression of target genes by disrupting tissue and cell-type specific regulatory elements. To address this challenge, we developed a methodological framework, eQTL-MAGMA (eMAGMA), that converts SNP-level summary statistics into gene-level association statistics by assigning non-coding SNPs to their putative genes based on tissue-specific eQTL information. We compared eMAGMA to three eQTL informed gene-based approaches--S-PrediXcan, FUSION, and SMR--using simulated phenotype data. Phenotypes were simulated based on eQTL reference data using GCTA for all genes with at least one eQTL at chromosome 1 (651 genes). We performed 10 simulations per gene. The eQTL-h2 (i.e., the proportion of variation explained by the eQTLs was set at 1%, 2%, and 5%. We found eMAGMA outperforms other gene-based approaches across a range of simulated parameters (e.g. the number of identified causal genes). When applied to genome-wide association summary statistics for major depression, eMAGMA identified substantially more putative candidate causal genes compared to other eQTL-based approaches. By integrating tissue-specific eQTL information, these results show eMAGMA will help to identify novel candidate causal genes from genome-wide association summary statistics and thereby improve the understanding of the biological basis of complex disorders.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Genetic Epidemiology
46 papers in training set
Top 0.1%
28.5%
2
Bioinformatics
1061 papers in training set
Top 3%
10.4%
3
The American Journal of Human Genetics
206 papers in training set
Top 0.7%
6.6%
4
Scientific Reports
3102 papers in training set
Top 22%
5.0%
50% of probability mass above
5
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.5%
5.0%
6
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.7%
7
PLOS Genetics
756 papers in training set
Top 5%
3.2%
8
BMC Bioinformatics
383 papers in training set
Top 3%
2.8%
9
PLOS Computational Biology
1633 papers in training set
Top 12%
2.7%
10
Biological Psychiatry
119 papers in training set
Top 1%
2.1%
11
PLOS ONE
4510 papers in training set
Top 47%
2.1%
12
Human Genetics and Genomics Advances
70 papers in training set
Top 0.2%
1.8%
13
Frontiers in Genetics
197 papers in training set
Top 4%
1.7%
14
Nature Communications
4913 papers in training set
Top 50%
1.7%
15
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
16
Nature Human Behaviour
85 papers in training set
Top 2%
1.7%
17
Human Molecular Genetics
130 papers in training set
Top 2%
1.5%
18
Communications Biology
886 papers in training set
Top 14%
1.3%
19
International Journal of Epidemiology
74 papers in training set
Top 2%
1.3%
20
American Journal of Medical Genetics Part B: Neuropsychiatric Genetics
22 papers in training set
Top 0.3%
1.3%
21
European Journal of Human Genetics
49 papers in training set
Top 0.9%
1.0%
22
Genetics
225 papers in training set
Top 3%
0.9%
23
Translational Psychiatry
219 papers in training set
Top 4%
0.8%
24
Human Brain Mapping
295 papers in training set
Top 4%
0.8%
25
Genetics in Medicine
69 papers in training set
Top 1%
0.5%
26
G3: Genes, Genomes, Genetics
222 papers in training set
Top 1%
0.5%
27
BMC Medical Genomics
36 papers in training set
Top 2%
0.5%