Back

A programmable seeker RNA guides target selection by IS1111 and IS110 type insertion sequences.

Siddiquee, R.; Pong, C. H.; Hall, R. M.; Ataide, S. F.

2024-04-27 microbiology
10.1101/2024.04.26.591405 bioRxiv
Show abstract

IS1111 and IS110 insertion sequence (IS) family members encode an unusual DEDD transposase type and exhibit specific target site selection. The IS1111 group include identifiable subterminal inverted repeats (sTIR) not found in the IS110 type [1]. IS in both families include a noncoding region (NCR) of significant length and, as each individual IS or group of closely related IS selects a different site, we had previously proposed that an NCR-derived RNA was involved in target selection [2]. Here, we found that the NCR is usually downstream of the transposase gene in IS1111 family IS and upstream in the IS110 type. Four IS1111 and one IS110 family members that target different sequences were used to demonstrate that the NCR determines a short seeker RNA (seekRNA) that co-purified with the transposase. The seekRNA was essential for transposition of the IS or a cargo flanked by IS ends from and to the preferred target. Short sequences matching both top and bottom strands of the target were identified in the seekRNA but their order in IS1111 and IS110 family IS was reversed. Reprogramming the seekRNA and donor flank to target a different site was demonstrated, indicating future biotechnological potential for these systems.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Mobile DNA
27 papers in training set
Top 0.1%
22.7%
2
PLOS ONE
4510 papers in training set
Top 27%
6.4%
3
Biochemical and Biophysical Research Communications
78 papers in training set
Top 0.1%
6.4%
4
Genes
126 papers in training set
Top 0.1%
4.9%
5
Scientific Reports
3102 papers in training set
Top 36%
3.6%
6
Gigabyte
60 papers in training set
Top 0.4%
2.9%
7
BMC Genomics
328 papers in training set
Top 2%
2.1%
8
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 2%
2.1%
50% of probability mass above
9
Biology Letters
66 papers in training set
Top 0.1%
2.1%
10
PLOS Genetics
756 papers in training set
Top 7%
1.9%
11
Nucleic Acids Research
1128 papers in training set
Top 10%
1.7%
12
Genome Biology and Evolution
280 papers in training set
Top 1.0%
1.7%
13
F1000Research
79 papers in training set
Top 2%
1.7%
14
International Journal of Molecular Sciences
453 papers in training set
Top 9%
1.5%
15
BMC Microbiology
35 papers in training set
Top 0.8%
1.3%
16
ACS Synthetic Biology
256 papers in training set
Top 2%
1.3%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
18
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.2%
19
Viruses
318 papers in training set
Top 4%
1.1%
20
Frontiers in Microbiology
375 papers in training set
Top 7%
1.0%
21
Gene
41 papers in training set
Top 2%
0.8%
22
Journal of Biological Chemistry
641 papers in training set
Top 4%
0.8%
23
Open Biology
95 papers in training set
Top 2%
0.8%
24
FEBS Letters
42 papers in training set
Top 0.3%
0.8%
25
Microbial Genomics
204 papers in training set
Top 2%
0.8%
26
PeerJ
261 papers in training set
Top 15%
0.8%
27
mBio
750 papers in training set
Top 11%
0.8%
28
Microbiology
57 papers in training set
Top 1%
0.8%
29
Access Microbiology
22 papers in training set
Top 0.7%
0.7%
30
Current Microbiology
18 papers in training set
Top 0.7%
0.7%