Back

Short Interrupted Repeats Cassette (SIRC) ensembles of plant genomes reflects evolutionary route

Gorbenko, I. V.; Scherbakov, D. Y.; Zverintseva, K. M.; Konstantinov, Y. M.

2026-03-30 plant biology
10.64898/2026.03.27.714674 bioRxiv
Show abstract

Short Interrupted Repeats Cassettes (SIRC) are recently discovered eukaryotic DNA elements possessing many traits of satellite DNA and mobile genetic elements, and consisted of short direct repeats interspersed with diverse spacer sequences. The SIRC ensemble of individual species is highly heterogenous and cannot be studied using alignment methods. It was found that number of similar SIRC sequences in a given pair of species is in general correlated with their taxonomic distance, and, at the same time, closely related species can possess very diverged SIRC ensembles, which makes SIRC evolutionary pattern closer to mobile genetic element type. The SIRC sequences make up clusters with comparable sequence patterns, that are likely to demonstrate doublet evolutionary model which strongly supports that the SIRC structure is supported by the evolutionary selection. Several SIRC sequences of Arabidopsis were found to be of ancient origin with traceable evolution history as far as to the moss clade. We carried out unbiased detection of SIRC ensembles in 10 plant genomes and found that, despite very high intraspecies heterogeneity, SIRC sets possess strong interspecies phylogenetic signal. Key messageShort Interrupted Repeats Cassettes are elements of ancient origin, and could potentially be used to trace organism history, and to facilitate syntheny and Hi-C analysis.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 13%
14.4%
2
Gene
41 papers in training set
Top 0.1%
8.4%
3
Frontiers in Genetics
197 papers in training set
Top 0.6%
7.2%
4
Journal of Molecular Evolution
21 papers in training set
Top 0.1%
6.4%
5
BMC Genomics
328 papers in training set
Top 0.4%
4.9%
6
Scientific Reports
3102 papers in training set
Top 29%
4.2%
7
Genes
126 papers in training set
Top 0.3%
3.6%
8
The Plant Journal
197 papers in training set
Top 1%
3.6%
50% of probability mass above
9
Frontiers in Plant Science
240 papers in training set
Top 3%
3.1%
10
Genome Biology and Evolution
280 papers in training set
Top 0.7%
2.4%
11
PeerJ
261 papers in training set
Top 6%
1.9%
12
Microorganisms
101 papers in training set
Top 0.6%
1.8%
13
F1000Research
79 papers in training set
Top 1%
1.8%
14
Biology
43 papers in training set
Top 0.8%
1.7%
15
International Journal of Molecular Sciences
453 papers in training set
Top 8%
1.7%
16
Mobile DNA
27 papers in training set
Top 0.1%
1.7%
17
BMC Biology
248 papers in training set
Top 1%
1.7%
18
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.7%
19
GigaScience
172 papers in training set
Top 2%
1.3%
20
Plant Physiology
217 papers in training set
Top 2%
1.1%
21
Genomics
60 papers in training set
Top 2%
0.9%
22
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
0.9%
23
Heliyon
146 papers in training set
Top 5%
0.9%
24
Plants
39 papers in training set
Top 1%
0.9%
25
Journal of Structural Biology
58 papers in training set
Top 1%
0.8%
26
PLOS Genetics
756 papers in training set
Top 14%
0.8%
27
Journal of Virology
456 papers in training set
Top 3%
0.8%
28
Peer Community Journal
254 papers in training set
Top 4%
0.7%
29
BMC Genomic Data
12 papers in training set
Top 0.2%
0.7%
30
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.7%