Back

A comparison of the genes and genesets identified by EWAS and GWAS of fourteen complex traits

Battram, T.; Gaunt, T. R.; Relton, C. L.; Timpson, N. J.; Hemani, G.

2022-03-25 epidemiology
10.1101/2022.03.25.22272928 medRxiv
Show abstract

Identifying the genes, properties of these genes and pathways to understand the underlying biology of complex traits responsible for differential health states in the population is a common goal of epigenome-wide and genome-wide association studies (EWAS and GWAS). GWAS identify genetic variants that effect the trait of interest or variants that are in linkage disequilibrium with the true causal variants. EWAS identify variation in DNA methylation, a complex molecular phenotype, associated with the trait of interest. Therefore, while GWAS in principle will only detect variants within or near causal genes, EWAS can also detect genes that confound the association between a trait and a DNA methylation site, or are reverse causal. Here we systematically compare association EWAS and GWAS results of 14 complex traits (N > 4500). A small fraction of detected genomic regions were shared by both EWAS and GWAS (0-9%). We evaluated if the genes or gene ontology terms flagged by GWAS and EWAS overlapped, and after a multiple testing correction, found substantial overlap for diastolic blood pressure (gene overlap P = 5.2x10-6, term overlap P = 0.001). We superimposed our empirical findings against simulated models of varying genetic and epigenetic architectures and observed that in a majority of cases EWAS and GWAS are likely capturing distinct genesets, implying that genes identified by EWAS are not generally causally upstream of the trait. Overall our results indicate that EWAS and GWAS are capturing different aspects of the biology of complex traits.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genetic Epidemiology
46 papers in training set
Top 0.1%
22.9%
2
PLOS Genetics
756 papers in training set
Top 1%
10.3%
3
Scientific Reports
3102 papers in training set
Top 9%
8.6%
4
Bioinformatics
1061 papers in training set
Top 4%
6.9%
5
Frontiers in Genetics
197 papers in training set
Top 0.7%
6.5%
50% of probability mass above
6
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
4.9%
7
PLOS ONE
4510 papers in training set
Top 38%
3.7%
8
Human Molecular Genetics
130 papers in training set
Top 1.0%
2.8%
9
eLife
5422 papers in training set
Top 31%
2.8%
10
PLOS Computational Biology
1633 papers in training set
Top 12%
2.6%
11
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.8%
12
F1000Research
79 papers in training set
Top 2%
1.7%
13
International Journal of Epidemiology
74 papers in training set
Top 1%
1.7%
14
European Journal of Epidemiology
40 papers in training set
Top 0.3%
1.7%
15
BMC Genomics
328 papers in training set
Top 3%
1.4%
16
Epigenetics
43 papers in training set
Top 0.6%
1.2%
17
Bioinformatics Advances
184 papers in training set
Top 4%
1.0%
18
Nature Communications
4913 papers in training set
Top 59%
0.9%
19
Wellcome Open Research
57 papers in training set
Top 2%
0.8%
20
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.8%
21
Epigenomics
10 papers in training set
Top 0.1%
0.8%
22
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
23
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 1%
0.7%
24
iScience
1063 papers in training set
Top 37%
0.7%
25
npj Genomic Medicine
33 papers in training set
Top 1%
0.7%
26
Human Brain Mapping
295 papers in training set
Top 5%
0.5%
27
PeerJ
261 papers in training set
Top 19%
0.5%