Back

MetaGEAR Explorer: Rapid interactive searches and cross-cohort analyses of microbiome gene associations in disease

Rios, E.; Jin, S.; Zhang, C.; Neuhaus, F.; He, X.; Weissenberger, S.; Schirmer, M.

2026-03-31 bioinformatics
10.64898/2026.03.30.715271 bioRxiv
Show abstract

The human gut microbiome has been linked to inflammatory bowel disease (IBD) and colorectal cancer (CRC), yet identifying disease-associated microbial genes across diverse human cohort studies remains challenging due to inconsistent data processing and the high dimensionality of gene-level abundance profiles. Here we present MetaGEAR Explorer, a web platform comprising a user interface and web services for interactive and programmatic gene-centric exploration of >33 million microbial gene families across 9,053 metagenomic samples from 24 IBD, CRC, and healthy cohorts. MetaGEAR Explorer facilitates gene searches against a catalog of non-redundant gene families via nucleotide or amino acid sequence queries (BLAST) and Pfam domain-based searches. For matched gene families, the platform computes disease-stratified prevalence, cross-cohort disease associations, species-level taxonomic stratification, and functional domain annotations. Importantly, users can also explore the genomic context of individual gene families via contig-based co-localization networks derived from metagenomic species pangenome (MSP) assignments and pivot from sequence to domain searches to identify functional homologs. Additionally, the platform features a dedicated catalog to interactively browse 13,795 MSPs and export results programmatically via API endpoints. We demonstrate MetaGEAR Explorers utility using the narG-encoding nitrate reductase gene and a case study of colibactin self-protection genes (clbS and DUF1706 homologs), where the platform revealed a consistent shift from commensals to Gammaproteobacteria carriers in disease. In summary, MetaGEAR Explorer enables rapid cross-cohort functional meta-analyses and is freely available at https://metagear-explorer.schirmerlab.de. GRAPHICAL ABSTRACT O_FIG O_LINKSMALLFIG WIDTH=177 HEIGHT=200 SRC="FIGDIR/small/715271v1_ufig1.gif" ALT="Figure 1"> View larger version (37K): org.highwire.dtl.DTLVardef@ea318dorg.highwire.dtl.DTLVardef@15b497borg.highwire.dtl.DTLVardef@354abcorg.highwire.dtl.DTLVardef@bd7dc5_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Microbiome
139 papers in training set
Top 0.1%
33.2%
2
Bioinformatics
1061 papers in training set
Top 3%
10.1%
3
Nucleic Acids Research
1128 papers in training set
Top 3%
6.8%
50% of probability mass above
4
Cell Reports Methods
141 papers in training set
Top 0.6%
4.4%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
4.2%
6
Nature Biotechnology
147 papers in training set
Top 2%
4.0%
7
Nature Communications
4913 papers in training set
Top 39%
3.6%
8
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.6%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.4%
10
Genome Biology
555 papers in training set
Top 3%
2.1%
11
Advanced Science
249 papers in training set
Top 9%
1.9%
12
mSystems
361 papers in training set
Top 4%
1.9%
13
GigaScience
172 papers in training set
Top 1%
1.7%
14
Cell Systems
167 papers in training set
Top 8%
1.3%
15
Genome Medicine
154 papers in training set
Top 6%
1.2%
16
Science China Life Sciences
26 papers in training set
Top 2%
1.0%
17
PLOS Computational Biology
1633 papers in training set
Top 21%
1.0%
18
BMC Bioinformatics
383 papers in training set
Top 6%
1.0%
19
Microbial Genomics
204 papers in training set
Top 2%
0.8%
20
Patterns
70 papers in training set
Top 2%
0.8%
21
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
22
Gut Microbes
70 papers in training set
Top 1%
0.7%
23
npj Systems Biology and Applications
99 papers in training set
Top 3%
0.6%
24
Methods in Ecology and Evolution
160 papers in training set
Top 3%
0.6%
25
mSphere
281 papers in training set
Top 7%
0.6%
26
Nature Methods
336 papers in training set
Top 7%
0.6%
27
eLife
5422 papers in training set
Top 61%
0.6%
28
Metabolites
50 papers in training set
Top 2%
0.5%
29
npj Biofilms and Microbiomes
56 papers in training set
Top 2%
0.5%