Back

Pangenome analysis of Clostridium scindens: a collection of diverse bile acid and steroid metabolizing commensal gut bacterial strains

Olivos-Caicedo, K. Y.; Fernandez, F.; Daniel, S. L.; Anantharaman, K.; Ridlon, J. M.; Alves, J. M. P.

2024-09-06 microbiology
10.1101/2024.09.06.610859 bioRxiv
Show abstract

Clostridium scindens is a commensal gut bacterium capable of forming the secondary bile acids deoxycholic acid and lithocholic acid from the primary bile acids cholic acid and chenodeoxycholic acid, respectively, as well as converting glucocorticoids to androgens. Historically, only two strains, C. scindens ATCC 35704 and C. scindens VPI 12708, have been characterized in vitro and in vivo to any significant extent. The formation of secondary bile acids is important in maintaining normal gastrointestinal function, in regulating the structure of the gut microbiome, in the etiology of such diseases such as cancers of the GI tract, and in the prevention of Clostridium difficile infection. We therefore wanted to determine the pangenome of 34 cultured strains of C. scindens and a set of 200 metagenome-assembled genomes (MAGs) to understand the variability among strains. The results indicate that the 34 strains of C. scindens have an open pangenome with 12,720 orthologous gene groups, and a core genome with 1,630 gene families, in addition to 7,051 and 4,039 gene families in the accessory and unique (i.e., strain-exclusive) genomes, respectively. The core genome contains 39% of the proteins with predicted metabolic function, and, in the unique genome, the function of storage and processing of information prevails, with 34% of the proteins being in that category. The pangenome profile including the MAGs also proved to be open. The presence of bile acid inducible (bai) and steroid-17,20-desmolase (des) genes was identified among groups of strains. The analysis reveals that C. scindens strains are distributed into two clades, indicating the possible onset of C. scindens separation into two species, confirmed by gene content, phylogenomic, and average nucleotide identity (ANI) analyses. This study provides insight into the structure and function of the C. scindens pangenome, offering a genetic foundation of significance for many aspects of research on the intestinal microbiota and bile acid metabolism.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Frontiers in Microbiology
375 papers in training set
Top 0.1%
21.9%
2
mSystems
361 papers in training set
Top 0.5%
12.2%
3
BMC Microbiology
35 papers in training set
Top 0.1%
8.2%
4
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 0.5%
6.2%
5
PLOS ONE
4510 papers in training set
Top 35%
4.2%
50% of probability mass above
6
Scientific Reports
3102 papers in training set
Top 29%
4.2%
7
mSphere
281 papers in training set
Top 2%
3.5%
8
Microorganisms
101 papers in training set
Top 0.2%
3.5%
9
Microbial Genomics
204 papers in training set
Top 0.8%
2.5%
10
Gut Microbes
70 papers in training set
Top 0.5%
1.8%
11
Microbiology Resource Announcements
22 papers in training set
Top 0.3%
1.8%
12
Environmental Microbiology
119 papers in training set
Top 2%
1.7%
13
mBio
750 papers in training set
Top 8%
1.6%
14
npj Biofilms and Microbiomes
56 papers in training set
Top 1%
1.6%
15
Current Microbiology
18 papers in training set
Top 0.2%
1.4%
16
Microbial Ecology
28 papers in training set
Top 0.2%
1.2%
17
Applied and Environmental Microbiology
301 papers in training set
Top 2%
1.2%
18
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
19
Microbiome
139 papers in training set
Top 3%
0.9%
20
PeerJ
261 papers in training set
Top 13%
0.9%
21
Microbiology Spectrum
435 papers in training set
Top 5%
0.8%
22
Archives of Microbiology
11 papers in training set
Top 0.4%
0.8%
23
Communications Biology
886 papers in training set
Top 27%
0.7%
24
Microbial Biotechnology
29 papers in training set
Top 1%
0.6%
25
ISME Communications
103 papers in training set
Top 2%
0.6%
26
The Journal of Nutrition
21 papers in training set
Top 0.7%
0.6%
27
Frontiers in Immunology
586 papers in training set
Top 9%
0.6%