Back

Landscape of retron diversity across the SPIRE prokaryotic metagenome resource reveals candidate novel type XI-like lineages

Toro, N.

2026-05-14 microbiology
10.64898/2026.05.14.725207 bioRxiv
Show abstract

Retrons are prokaryotic genetic elements encoding a specialized reverse transcriptase (RT) that synthesizes multicopy single-stranded DNA (msDNA) and are increasingly recognized as components of bacterial anti-phage defense systems. However, their diversity and ecological distribution across large-scale genomic resources remain poorly characterized. Here, we surveyed retron RTs across the SPIRE representative metagenome collection, a non-redundant, species-level dataset spanning diverse microbial habitats. Using type-specific hidden Markov models cross-calibrated against all known retron groups, we identified retrons representing all canonical types together with additional divergent lineages. Retron distribution was highly structured across both taxonomy and ecology, with some lineages restricted to specific bacterial phyla and others broadly distributed across habitats. Host-associated environments accounted for the largest fraction of retron candidates, although several retron types showed clear specialization for aquatic or anthropogenic niches. Systematic novelty assessment identified two candidate type XI-like lineages, TXI_C2like and TXI_noncan_h, characterized by protease-independent architectures and lineage-specific accessory modules. De novo computational analyses further identified candidate msr/msd-like non-coding RNA elements upstream of the RT genes in both lineages, indicating that these highly divergent systems retain the defining RT-ncRNA architecture of canonical retrons. Together, these findings expand the known diversity of retron systems and reveal a strong interplay between evolutionary history and ecological specialization.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 12%
14.1%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 7%
9.7%
3
Cell Host & Microbe
113 papers in training set
Top 0.6%
8.1%
4
Cell
370 papers in training set
Top 2%
6.9%
5
Nature Microbiology
133 papers in training set
Top 0.5%
6.1%
6
Microbiome
139 papers in training set
Top 0.6%
6.1%
50% of probability mass above
7
Cell Reports
1338 papers in training set
Top 12%
4.1%
8
Nature
575 papers in training set
Top 7%
3.8%
9
Nucleic Acids Research
1128 papers in training set
Top 6%
3.5%
10
Nature Ecology & Evolution
113 papers in training set
Top 2%
3.4%
11
Molecular Cell
308 papers in training set
Top 5%
3.4%
12
mBio
750 papers in training set
Top 5%
3.4%
13
Science
429 papers in training set
Top 10%
3.0%
14
PLOS Biology
408 papers in training set
Top 5%
2.6%
15
eLife
5422 papers in training set
Top 37%
2.0%
16
mSystems
361 papers in training set
Top 4%
1.8%
17
ISME Communications
103 papers in training set
Top 1%
1.6%
18
The ISME Journal
194 papers in training set
Top 2%
1.4%
19
Nature Biotechnology
147 papers in training set
Top 6%
1.2%
20
Molecular Biology and Evolution
488 papers in training set
Top 3%
1.2%
21
Science Advances
1098 papers in training set
Top 28%
0.9%
22
Genome Biology
555 papers in training set
Top 7%
0.9%
23
Genome Medicine
154 papers in training set
Top 9%
0.7%
24
Microbial Genomics
204 papers in training set
Top 3%
0.6%
25
PLOS Genetics
756 papers in training set
Top 17%
0.6%