Back

Gene loss and acquisition in lineages of bacteria evolving in a human host environment

Gabrielaite, M.; Johansen, H. K.; Molin, S.; Nielsen, F. C.; Marvig, R. L.

2020-02-03 evolutionary biology
10.1101/2020.02.03.931667 bioRxiv
Show abstract

While genome analyses have documented that there are differences in the gene repertoire between evolutionary distant lineages of the same bacterial species, less is known about micro-evolutionary dynamics of gene loss and acquisition within lineages of bacteria as they evolve over the timescale of years. This knowledge is valuable to understand both the basic mutational steps that on long timescales lead to evolutionary distant bacterial lineages, and the evolution of the individual lineages themselves. In the case that lineages evolve in a human host environment, gene loss and acquisition may furthermore have implication for disease. We analyzed the genomes of 45 Pseudomonas aeruginosa lineages evolving in the lungs of cystic fibrosis patients to identify genes that are lost or acquired during the first years of infection in each of the different lineages. On average, the lineage genome content changed with 88 genes (range 0-473). Genes were more often lost than acquired, and prophage genes were more variable than bacterial genes. We identified genes that were lost or acquired independently across different clonal lineages, i.e. convergent molecular evolution. Convergent evolution suggests that there is a selection for loss and acquisition of certain genes in the host environment. We find that a significant proportion of such genes are associated with virulence; a trait previously shown to be important for adaptation. Furthermore, we also compared the genomes across lineages to show that within-lineage variable genes more often belonged to genomic content not shared across all lineages. Finally, we used 4,760 genes shared by 446 P. aeruginosa genomes to develop a stable and discriminatory typing scheme for P. aeruginosa clone types (Pactyper, https://github.com/MigleSur/Pactyper). In sum, our analysis adds to the knowledge on the pace and drivers of gene loss and acquisition in bacteria evolving over multiple years in a human host environment and provides a basis to further understand how gene loss and acquisition plays a role in lineage differentiation and host adaptation. Data SummaryP. aeruginosa genome sequencing data has been made publicly available by Marvig et al. (2015) and is deposited in Sequence Read Archive (SRA) under accession ERP004853.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Biology and Evolution
280 papers in training set
Top 0.1%
37.3%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.3%
12.5%
3
Microbial Genomics
204 papers in training set
Top 0.3%
8.1%
50% of probability mass above
4
Evolution Letters
71 papers in training set
Top 0.6%
3.9%
5
BMC Genomics
328 papers in training set
Top 0.9%
3.5%
6
mBio
750 papers in training set
Top 5%
3.0%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 30%
1.9%
8
eLife
5422 papers in training set
Top 40%
1.8%
9
mSphere
281 papers in training set
Top 3%
1.7%
10
Journal of Molecular Evolution
21 papers in training set
Top 0.2%
1.5%
11
Molecular Ecology
304 papers in training set
Top 3%
1.2%
12
Genome Research
409 papers in training set
Top 3%
1.2%
13
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 5%
1.2%
14
Genetics
225 papers in training set
Top 3%
1.1%
15
PLOS Genetics
756 papers in training set
Top 12%
1.1%
16
Scientific Reports
3102 papers in training set
Top 68%
1.1%
17
PLOS Biology
408 papers in training set
Top 16%
0.9%
18
Nature Communications
4913 papers in training set
Top 60%
0.9%
19
mSystems
361 papers in training set
Top 6%
0.9%
20
Cell Host & Microbe
113 papers in training set
Top 5%
0.8%
21
Philosophical Transactions of the Royal Society B: Biological Sciences
53 papers in training set
Top 1%
0.7%
22
Evolutionary Applications
91 papers in training set
Top 1%
0.7%
23
Life Science Alliance
263 papers in training set
Top 2%
0.6%
24
The American Naturalist
114 papers in training set
Top 2%
0.6%