LOCOM2: Robust Differential Abundance Analysis for Microbiome Data
He, M.; Satten, G. A.; Hu, Y.-J.
Show abstract
BackgroundNumerous methods have been developed for differential abundance analysis of microbiome data; however, many fail to adequately control error rates, contributing to the reproducibility crisis in microbiome research. Moreover, new challenges have emerged, including large-scale studies, differential library size distributions, unbalanced case-control designs, and the increasing availability of only relative-abundance data rather than read counts. MethodsWe propose LOCOM2 to address these challenges. The method refines the weighting scheme in LOCOM to eliminate confounding by library size while accommodating relative abundance data. It incorporates a series of adjustments to ensure stable and reliable estimation, even under extreme conditions such as very rare taxa and highly unbalanced case-control designs. In addition, LOCOM2 replaces the computationally intensive permutation procedure in LOCOM with a Wald-type test, substantially improving computational efficiency. To evaluate performance, we conducted extensive simulation studies using the MIDASim simulator and three data templates representing diverse body sites. We benchmarked LOCOM2 against state-of-the-art methods, including LOCOM, LinDA, ANCOM-BC2, MaAsLin2, and MaAsLin3. This benchmarking effort provides an essential foundation for the next stage of microbiome research. ResultsLOCOM2 achieved accurate control of the false discovery rate across all simulation scenarios, whereas none of the other methods consistently did so. LOCOM2 also demonstrated the highest sensitivity for detecting true signals. Applications of these methods to three real microbiome datasets further corroborated these findings.
Matching journals
The top 3 journals account for 50% of the predicted probability mass.