Back

Evolutionary trajectories of secondary replicons in multipartite genomes

Dranenko, N. O.; Rodina, A. D.; Demenchuk, Y. V.; Gelfand, M. S.; Bochkareva, O. O.

2023-04-09 genomics
10.1101/2023.04.09.536151 bioRxiv
Show abstract

Most bacterial genomes have a single chromosome that may be supplemented by smaller, dispensable plasmids. However, approximately 10% of bacteria with completely sequenced genomes, mostly pathogens and plant symbionts, have more than one stable large replicon. Some secondary replicons are species-specific, carrying pathogenicity or symbiotic factors. Other replicons are common on at least the genus level, carry house-keeping genes, and may have a size of several million base pairs. We analyzed the abundance and sizes of large secondary replicons in different groups of bacteria and identified two patterns in the evolution of multipartite genomes. In nine genera of four families, Pseudoalteromonadaceae, Burkholderiaceae, Vibrionaceae, and Brucellaceae, we observed a positive correlation between the sizes of the chromosome and the secondary replicon with the slope in the range of 0.6-1.2. This indicates that in these genera the replicons evolve in a coordinated manner, with comparable rates of gene gain/loss, hence supporting classification of such secondary replicons as chromids. The second, more common pattern, features gene gains and losses mainly occurring in the primary replicon, yielding a stable size of the secondary replicon. Such secondary replicons are usually present in only a low fraction of the genus species. Hence, such replicons behave as megaplasmids. A mixed situation was observed in symbiotic genera from the Rhizobiaceae family where the large secondary replicons are of stable size, but present in all species. These results may provide a general framework for understanding the evolution of genome complexity in prokaryotes. SignificanceLarge secondary replicons are observed in representatives of many taxonomic groups of bacteria. Traditionally, they are referred to as second chromosomes, chromids, or megaplasmids, with little consistency, in particular because their evolution remains understudied. Here we demonstrate that the sizes of secondary replicons follow two main evolutionary trends: replicons whose size scales linearly with the size of the main chromosome (the suggested term chromids) typically contain numerous essential genes (rRNA, tRNA, ribosomal proteins), while large secondary replicons of stable size (termed megaplasmids) contain fewer or none such genes.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Genome Biology and Evolution
280 papers in training set
Top 0.1%
32.0%
2
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 0.3%
8.1%
3
Nucleic Acids Research
1128 papers in training set
Top 3%
6.6%
4
Molecular Biology and Evolution
488 papers in training set
Top 0.8%
6.1%
50% of probability mass above
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 15%
4.7%
6
mBio
750 papers in training set
Top 4%
4.0%
7
Microbial Genomics
204 papers in training set
Top 0.6%
3.8%
8
PLOS Genetics
756 papers in training set
Top 5%
3.5%
9
mSystems
361 papers in training set
Top 4%
2.4%
10
PLOS Biology
408 papers in training set
Top 8%
1.8%
11
BMC Biology
248 papers in training set
Top 1%
1.6%
12
Scientific Reports
3102 papers in training set
Top 60%
1.6%
13
eLife
5422 papers in training set
Top 44%
1.6%
14
Nature Communications
4913 papers in training set
Top 54%
1.4%
15
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
16
Genome Biology
555 papers in training set
Top 5%
1.3%
17
Journal of Molecular Evolution
21 papers in training set
Top 0.2%
1.2%
18
BMC Genomics
328 papers in training set
Top 5%
0.9%
19
Molecular Ecology
304 papers in training set
Top 4%
0.7%
20
Communications Biology
886 papers in training set
Top 26%
0.7%
21
PLOS ONE
4510 papers in training set
Top 70%
0.7%
22
Frontiers in Microbiology
375 papers in training set
Top 10%
0.7%
23
Frontiers in Genetics
197 papers in training set
Top 11%
0.7%
24
Genome Research
409 papers in training set
Top 5%
0.6%
25
New Phytologist
309 papers in training set
Top 5%
0.6%