Back

Integrated analysis reveals strong reproducible signals within and across studies of the built environment

Flemister, A. B.; Blakley, I. C.; Fodor, A. A.

2026-04-03 microbiology
10.64898/2026.04.03.716326 bioRxiv
Show abstract

BackgroundBuilt environment microbiome studies have identified numerous factors that shape indoor microbiomes, yet the reproducibility of these findings across buildings, timepoints, and research groups remains unclear. Differences in sequencing protocols, sampling design, and environments pose major challenges for cross-study comparisons, particularly in low-biomass environments where technical variation can obscure biological signal. To address this gap, we constructed a simple ontology which groups samples into one of three categories: hand, hand-associated surfaces, and floor then applied it to four publicly available 16S rRNA gene datasets: a hospital, university dormitory, Air Force dormitory, and private residential houses. ResultsWe identified strong and reproducible separation between floors and surfaces with frequent human contact. We found that floors consistently harbored soil-associated taxa, including KD4-96, 67-14, Skermanella, and Sphingobacterium, whereas hands and hand-associated surfaces were enriched with skin-associated genera, such as Lawsonella and Cutibacterium. Within studies, these results were generally consistent across timepoints. Across studies, mixed-model PERMANOVA analysis revealed significant clustering by sample type, with modest effects of study, suggesting that biological signal outweighed differences in laboratory or sequencing methods. Leave-one-study-out random forest models achieved high AUCs for hand vs. floor comparisons (0.865 to 0.921), moderate AUCs for hand-associated vs. floor comparisons, and weaker performance for hand vs. hand-associated comparisons. Application of the batch-correction method DEBIAS-M did not improve effect sizes or classification performance, indicating that reproducible structure was already discernible without batch adjustment. ConclusionsDespite substantial temporal and environmental heterogeneity among studies, we found that the built environment microbiome has a reproducible bacterial signal. There was consistent enrichment of soil-derived taxa on floors and human-associated taxa on hands and hand-associated surfaces suggesting a stable microbiome despite differences in building type, occupancy, and methodology. These findings establish an important foundation for future studies, suggesting cross-study comparability, the accuracy of ecological inference, and the ability to support the development of predictive applications in indoor microbiome research.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
mSystems
361 papers in training set
Top 0.3%
14.6%
2
PLOS ONE
4510 papers in training set
Top 19%
10.0%
3
mSphere
281 papers in training set
Top 0.5%
6.8%
4
Microbiome
139 papers in training set
Top 0.5%
6.3%
5
Scientific Reports
3102 papers in training set
Top 19%
6.3%
6
Microbial Genomics
204 papers in training set
Top 0.5%
4.3%
7
Nature Communications
4913 papers in training set
Top 37%
3.9%
50% of probability mass above
8
Environmental Microbiome
26 papers in training set
Top 0.1%
3.6%
9
Applied and Environmental Microbiology
301 papers in training set
Top 1%
2.6%
10
Microbiology Spectrum
435 papers in training set
Top 1%
2.4%
11
npj Biofilms and Microbiomes
56 papers in training set
Top 0.7%
2.3%
12
Frontiers in Microbiology
375 papers in training set
Top 5%
1.9%
13
mBio
750 papers in training set
Top 7%
1.9%
14
FEMS Microbes
14 papers in training set
Top 0.1%
1.8%
15
Microbial Ecology
28 papers in training set
Top 0.1%
1.6%
16
ISME Communications
103 papers in training set
Top 1%
1.6%
17
Environmental Science & Technology
64 papers in training set
Top 2%
1.3%
18
Nature Microbiology
133 papers in training set
Top 3%
1.2%
19
The ISME Journal
194 papers in training set
Top 2%
1.2%
20
Methods in Ecology and Evolution
160 papers in training set
Top 2%
1.2%
21
PLOS Computational Biology
1633 papers in training set
Top 20%
1.1%
22
Environmental Microbiology
119 papers in training set
Top 2%
0.9%
23
FEMS Microbiology Ecology
47 papers in training set
Top 0.4%
0.9%
24
Environmental Microbiology Reports
27 papers in training set
Top 0.8%
0.7%
25
PeerJ
261 papers in training set
Top 16%
0.7%
26
International Journal of Environmental Research and Public Health
124 papers in training set
Top 8%
0.6%
27
Pediatric Research
18 papers in training set
Top 0.5%
0.6%
28
Cell
370 papers in training set
Top 19%
0.6%
29
Cell Reports Methods
141 papers in training set
Top 6%
0.6%
30
Environmental Science & Technology Letters
22 papers in training set
Top 0.5%
0.6%