Back

Rapid evolution and comparative analysis of piRNA clusters in D.simulans

Narayanan, P.; Srivastav, S.; Signor, S.

2026-01-20 evolutionary biology
10.64898/2026.01.19.700409 bioRxiv
Show abstract

Eukaryotic genomes are ubiquitously occupied by mobile genetic elements termed transposons, which are silenced via a specialized class of small RNA called piRNA. The small RNA is produced from the transposons themselves when they occupy specialized regions of the genome termed piRNA clusters. The formation of these specialized regions, or their evolution over time, is not well understood. Recent work has suggested that they are extremely variable even within a single species such as Drosophila melanogaster. We were interested in taking a comparative approach to piRNA cluster evolution to ask the question - what processes are unique to D. melanogaster and which are shared? Shared phenomena are more likely to be fundamental aspects of piRNA formation and evolution compared to those that are more labile. Using five high-quality long-read genome assemblies and five genotype-specific piRNA libraries, we approach this question from a population genetics standpoint. We annotate piRNA clusters, transposons, and structural variants in each of these five genomes. We found extensive variation in piRNA clusters across strains, with smaller piRNA clusters more likely to be limited to a single genotype. By and large, our results are consistent with a model of piRNA cluster evolution in which piRNA clusters are rapidly formed and lost, with a small subset increasing in frequency and length over time. However, we find that the TEs which nucleate the formation of small piRNA clusters are entirely distinct in D. simulans compared to D. melanogaster, and likely reflect its invasion history rather than any inherent property of the transposon to nucleate clusters. Therefore, while large common clusters can act as traps as has been posited for piRNA clusters, there are also numerous small clusters that are born and lost rapidly within a species.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Genome Biology and Evolution
280 papers in training set
Top 0.1%
28.0%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.3%
10.6%
3
Genome Research
409 papers in training set
Top 0.2%
8.5%
4
PLOS Genetics
756 papers in training set
Top 2%
8.3%
50% of probability mass above
5
Genetics
225 papers in training set
Top 0.6%
7.3%
6
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 0.7%
4.9%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 19%
3.7%
8
Mobile DNA
27 papers in training set
Top 0.1%
3.6%
9
Current Biology
596 papers in training set
Top 8%
2.1%
10
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
1.9%
11
Molecular Ecology
304 papers in training set
Top 2%
1.9%
12
Evolution
199 papers in training set
Top 1%
1.5%
13
GENETICS
189 papers in training set
Top 1%
0.9%
14
eLife
5422 papers in training set
Top 53%
0.9%
15
PLOS ONE
4510 papers in training set
Top 67%
0.8%
16
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 6%
0.8%
17
PLOS Biology
408 papers in training set
Top 19%
0.8%
18
Genome Biology
555 papers in training set
Top 8%
0.7%
19
Nature Ecology & Evolution
113 papers in training set
Top 4%
0.7%
20
Nature Communications
4913 papers in training set
Top 64%
0.7%
21
Virus Evolution
140 papers in training set
Top 2%
0.7%
22
Evolution Letters
71 papers in training set
Top 2%
0.7%
23
PLOS Pathogens
721 papers in training set
Top 10%
0.7%
24
PLOS Computational Biology
1633 papers in training set
Top 29%
0.5%
25
BMC Biology
248 papers in training set
Top 7%
0.5%
26
Frontiers in Genetics
197 papers in training set
Top 12%
0.5%