Back

Mixture Margin Random-effects Copula Models for Inferring Temporally Conserved Microbial Co-variation Networks from Longitudinal Data

Deek, R. A.; Li, H.

2022-04-26 bioinformatics
10.1101/2022.04.25.489333 bioRxiv
Show abstract

Longitudinal microbiome studies, in which data on a single subject is collected repeatedly over time, are becoming increasingly common in biomedical research. Such studies provide an opportunity to study the inherently dynamic nature of a microbiome in a way that cannot be done using cross-sectional studies. In this paper, we develop random-effects copula models with mixed zero-beta margins to identify biologically meaningful temporally conserved co-variation between two bacterial taxa, while accounting for the excessive zeros seen in 16S rRNA and metagenomic sequencing data. The model assumes a random-effects model for the dependence parameter in the copulas, which captures the conserved microbial co-variation while allowing for a time-specific dependence parameters. We develop a Monte Carlo EM algorithm for efficient estimation of model parameters and a corresponding Monte Carlo likelihood ratio test for the mean dependence parameter. Simulation studies show that our test controls the Type I error rate and provides an unbiased estimate of the mean dependence parameter. Additionally, we apply our method to a longitudinal pediatric cohort and identify changes in both local and global patterns of microbial co-variation networks in infants treated with antibiotics. Our analysis shows that the no antibiotics network is less dependent on individual taxon, thus making it more stable than the antibiotics network and more robust to both targeted and random attacks. Author summaryIdentification of co-variation between two microbes in microbial communities provides important insights into the community structure and stability. The commonly used measures of co-variation do not handle excessive zeros observed in the data and cannot be applied to longitudinal microbiome data directly. In this paper, we develop random-effects copula models with mixed zero-beta margins to identify biologically meaningful temporally conserved co-variation between two bacterial taxa, while accounting for the excessive zeros seen in 16S rRNA and metagenomic sequencing data. The model captures the conserved microbial co-variations while allowing for a time-specific dependence parameters. We develop an efficient Monte Carlo-based algorithm for parameter estimation and statistical inference. We analyze the data from a pediatric longitudinal cohort and identify changes in both local and global patterns of microbial co-variation networks in infants treated with antibiotics.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.8%
26.4%
2
PLOS Computational Biology
1633 papers in training set
Top 2%
13.0%
3
mSystems
361 papers in training set
Top 1%
7.3%
4
PLOS ONE
4510 papers in training set
Top 31%
5.0%
50% of probability mass above
5
Biometrics
22 papers in training set
Top 0.1%
5.0%
6
BMC Bioinformatics
383 papers in training set
Top 3%
3.7%
7
Statistics in Medicine
34 papers in training set
Top 0.1%
3.1%
8
mSphere
281 papers in training set
Top 2%
2.8%
9
Microbial Genomics
204 papers in training set
Top 0.9%
2.1%
10
Frontiers in Genetics
197 papers in training set
Top 4%
1.9%
11
PLOS Genetics
756 papers in training set
Top 7%
1.9%
12
Scientific Reports
3102 papers in training set
Top 57%
1.7%
13
Biostatistics
21 papers in training set
Top 0.1%
1.7%
14
Methods in Ecology and Evolution
160 papers in training set
Top 1%
1.5%
15
Microbiome
139 papers in training set
Top 2%
1.4%
16
PeerJ
261 papers in training set
Top 9%
1.4%
17
BMC Genomics
328 papers in training set
Top 3%
1.3%
18
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
19
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
20
Journal of Computational Biology
37 papers in training set
Top 0.6%
0.8%
21
Frontiers in Microbiology
375 papers in training set
Top 9%
0.7%
22
Cell Reports Methods
141 papers in training set
Top 5%
0.7%
23
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.7%
24
Journal of Theoretical Biology
144 papers in training set
Top 2%
0.5%
25
Computational and Structural Biotechnology Journal
216 papers in training set
Top 12%
0.5%