Back

Annotation of piRNA source loci in the genome of non-model insects

Halbach, R.; van Rij, R. P.

2024-08-15 molecular biology
10.1101/2024.08.15.608080 bioRxiv
Show abstract

The PIWI-interacting RNA (piRNA) pathway plays a crucial role in the defense of metazoan genomes against parasitic transposable elements. The major source of piRNAs in the model organism Drosophila melanogaster are defective transposon copies located in piRNA clusters - genomic regions with a high piRNA density that are thought to serve as an immunological memory of past invasion by those elements. Different approaches have been used to annotate piRNA clusters in model organisms like flies, mice and rats, and software such as proTRAC or piClust are available for piRNA cluster annotation. However, these software often make assumptions based on current knowledge of piRNA clusters from (mostly vertebrate) model organisms, which do not necessarily hold true for non-model insects in which the piRNA pathway is less understood. Here we describe a simple piRNA cluster annotation approach that utilizes very little assumptions on the biology of the piRNA pathway. The pipeline has been validated on mosquito genomes but can be easily used for other non-model insect species as well.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Mobile DNA
27 papers in training set
Top 0.1%
40.2%
2
BMC Genomics
328 papers in training set
Top 0.4%
4.9%
3
Frontiers in Genetics
197 papers in training set
Top 1%
4.4%
4
Nucleic Acids Research
1128 papers in training set
Top 5%
4.0%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 38%
3.7%
6
Scientific Reports
3102 papers in training set
Top 40%
3.3%
7
Genome Biology and Evolution
280 papers in training set
Top 0.6%
2.8%
8
PLOS Computational Biology
1633 papers in training set
Top 14%
1.9%
9
PeerJ
261 papers in training set
Top 5%
1.9%
10
Genomics
60 papers in training set
Top 0.8%
1.8%
11
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
1.7%
12
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
13
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
14
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.1%
15
PLOS Genetics
756 papers in training set
Top 12%
1.1%
16
Viruses
318 papers in training set
Top 4%
1.0%
17
BMC Biology
248 papers in training set
Top 3%
0.9%
18
Peer Community Journal
254 papers in training set
Top 3%
0.9%
19
Biology Open
130 papers in training set
Top 2%
0.8%
20
Microorganisms
101 papers in training set
Top 2%
0.8%
21
Bioinformatics
1061 papers in training set
Top 9%
0.8%
22
Journal of Molecular Evolution
21 papers in training set
Top 0.4%
0.8%
23
Genes
126 papers in training set
Top 3%
0.8%
24
Journal of Virology
456 papers in training set
Top 3%
0.8%
25
Wellcome Open Research
57 papers in training set
Top 2%
0.7%
26
iScience
1063 papers in training set
Top 33%
0.7%
27
Biology
43 papers in training set
Top 3%
0.7%
28
Insect Molecular Biology
19 papers in training set
Top 0.2%
0.7%
29
International Journal of Molecular Sciences
453 papers in training set
Top 19%
0.5%
30
Genome Biology
555 papers in training set
Top 9%
0.5%