Back

A Comparative Assessment of edgeR and methylKit Pipelines for DNA Methylation Detection

Munoa Hoyos, I.; Araolaza, M.; Calzado, I.; Albizuri, M.; Subiran, N.

2025-05-15 bioinformatics
10.1101/2025.05.11.653026 bioRxiv
Show abstract

Despite the improvements in tool development for DNA methylation analysis, there is a lack of a consensus on computational and statistical models used for differentially methylated cytosine (DMC) identification. This variability complicates the interpretation of findings and raises concerns about the reproducibility and biological significance of the detected results. In this regard, the primary objective of this study was to compare the performance, concordance, and biological relevance of edgeR and methylKit tools in detecting DMCs (the first one based on fold change and the second one based on percentage), following morphine exposure model in mouse embryonic stem cells (mESCs). While a different number of total DMCs was identified by each tool, both pipelines detected a global hypomethylation as a result. Genomic analysis revealed a predominant distribution of DMCs in intergenic and intronic regions on one hand, and in open sea regions on the other hand. Despite the differences in sensitivity, both tools demonstrated moderate concordance in DMCs detection ([~]56%) and high concordance in gene level analysis ([~]90%), identifying similar differentially methylated genes (DMGs). Overall, the results underscore the complementary strengths of methylKit and edgeR and highlight the importance of tool selection for epigenetic studies. As a conclusion, integrating both pipelines is recommended for comprehensive analysis, particularly in studies with complex experimental designs.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Epigenetics
43 papers in training set
Top 0.1%
23.8%
2
PLOS ONE
4510 papers in training set
Top 16%
11.0%
3
Clinical Epigenetics
53 papers in training set
Top 0.1%
8.9%
4
BMC Bioinformatics
383 papers in training set
Top 1%
6.7%
50% of probability mass above
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1.0%
4.4%
6
Epigenetics & Chromatin
42 papers in training set
Top 0.1%
3.8%
7
Bioinformatics
1061 papers in training set
Top 5%
3.8%
8
Scientific Reports
3102 papers in training set
Top 40%
3.4%
9
Frontiers in Bioinformatics
45 papers in training set
Top 0.1%
2.0%
10
International Journal of Molecular Sciences
453 papers in training set
Top 7%
1.8%
11
PeerJ
261 papers in training set
Top 8%
1.6%
12
BMC Genomics
328 papers in training set
Top 3%
1.6%
13
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.4%
14
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
15
Cancers
200 papers in training set
Top 4%
1.2%
16
Bioinformatics Advances
184 papers in training set
Top 4%
1.0%
17
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.9%
18
Biochemistry and Biophysics Reports
28 papers in training set
Top 1%
0.8%
19
Genes
126 papers in training set
Top 3%
0.8%
20
Gene
41 papers in training set
Top 2%
0.7%
21
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 10%
0.7%
22
Genomics
60 papers in training set
Top 3%
0.7%
23
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.5%
24
BMC Genomic Data
12 papers in training set
Top 0.3%
0.5%