Back

Impact of experimental bias on compositional analysis of microbiome data

Hu, Y.; Satten, G. A.; Hu, Y.

2023-02-13 bioinformatics
10.1101/2023.02.08.527766 bioRxiv
Show abstract

Microbiome data are subject to experimental bias that is caused by DNA extraction, PCR amplification among other sources, but this important feature is often ignored when developing statistical methods for analyzing microbiome data. McLaren, Willis and Callahan (2019) proposed a model for how such bias affects the observed taxonomic profiles, which assumes main effects of bias without taxon-taxon interactions. Our newly developed method, LOCOM (logistic regression for compositional analysis) for testing differential abundance of taxa, is the first method that accounted for experimental bias and is robust to the main effect biases. However, there is also evidence for taxon-taxon interactions. In this report, we formulated a model for interaction biases and used simulations based on this model to evaluate the impact of interaction biases on the performance of LOCOM as well as other available compositional analysis methods. Our simulation results indicated that LOCOM remained robust to a reasonable range of interaction biases. The other methods tended to have inflated FDR even when there were only main effect biases. LOCOM maintained the highest sensitivity even when the other methods cannot control the FDR. We thus conclude that LOCOM outperforms the other methods for compositional analysis of microbiome data considered here.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1.0%
23.2%
2
PeerJ
261 papers in training set
Top 0.1%
19.3%
3
BMC Bioinformatics
383 papers in training set
Top 1%
8.7%
50% of probability mass above
4
Methods in Ecology and Evolution
160 papers in training set
Top 0.7%
4.1%
5
PLOS ONE
4510 papers in training set
Top 37%
3.7%
6
Frontiers in Genetics
197 papers in training set
Top 3%
2.1%
7
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
8
Microbiome
139 papers in training set
Top 2%
1.9%
9
Frontiers in Microbiology
375 papers in training set
Top 5%
1.8%
10
Bioinformatics Advances
184 papers in training set
Top 3%
1.8%
11
Scientific Reports
3102 papers in training set
Top 56%
1.7%
12
Ecological Informatics
29 papers in training set
Top 0.4%
1.7%
13
BMC Genomics
328 papers in training set
Top 3%
1.5%
14
Ecology and Evolution
232 papers in training set
Top 3%
1.5%
15
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.4%
16
mSphere
281 papers in training set
Top 4%
1.4%
17
Journal of Computational Biology
37 papers in training set
Top 0.3%
1.3%
18
Statistics in Medicine
34 papers in training set
Top 0.2%
1.3%
19
F1000Research
79 papers in training set
Top 2%
1.3%
20
mSystems
361 papers in training set
Top 6%
1.0%
21
Frontiers in Bioinformatics
45 papers in training set
Top 0.8%
0.8%
22
Molecular Ecology Resources
161 papers in training set
Top 1%
0.8%
23
Environmental DNA
49 papers in training set
Top 0.3%
0.8%
24
Peer Community Journal
254 papers in training set
Top 4%
0.7%
25
Microorganisms
101 papers in training set
Top 3%
0.5%