Back

Exploring Alu-Driven DNA Transductions in the Primate Genomes

Halabian, R.; Storer, J. M.; Hoyt, S. J.; Hartley, G. A.; Brosius, J.; O'Neill, R. J.; Makalowski, W.

2024-04-30 genetics
10.1101/2024.04.29.591526 bioRxiv
Show abstract

Long terminal repeats (LTRs) and non-LTRs retrotransposons, aka retroelements, collectively occupy a substantial part of the human genome. Certain non-LTR retroelements, such as L1 and SVA, have the potential for DNA transduction, which involves the concurrent mobilization of flanking non-transposon DNA during retrotransposition. These events can be detected by computational approaches. Despite being the most abundant short interspersed sequences (SINEs) that are still active within the genomes of humans and other primates, the transduction rate caused by Alu sequences remains unexplored. Therefore, we conducted an analysis to address this research gap and utilized an in-house program to probe for the presence of Alu-related transductions in the human genome. We analyzed 118,489 full-length AluY subfamilies annotated within the first complete human reference genome, T2T-CHM13. For comparative insights, we extended our exploration to two non-human primate genomes, the chimpanzee and the rhesus monkey. After manual curation, our findings did not confirm any Alu-mediated transductions, whose source genes are, unlike L1 or SVA, transcribed by RNA polymerase III, implying that they are infrequent or possibly absent not only in the human but also in chimpanzee and rhesus monkey genomes. Although we identified loci in which the 3 Target Site Duplication (TSD) was located distantly from the retrotransposed AluYs, a transduction hallmark, our study could not find further support for such events. The observation of these instances can be explained by the incorporation of other nucleotides into the poly(A) tails in conjunction with polymerase slippage.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Mobile DNA
27 papers in training set
Top 0.1%
28.6%
2
Frontiers in Genetics
197 papers in training set
Top 0.1%
19.3%
3
eLife
5422 papers in training set
Top 16%
5.0%
50% of probability mass above
4
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
5.0%
5
Journal of Virology
456 papers in training set
Top 1%
3.7%
6
Scientific Reports
3102 papers in training set
Top 42%
3.0%
7
PLOS Genetics
756 papers in training set
Top 6%
2.5%
8
BMC Genomics
328 papers in training set
Top 2%
2.1%
9
Nature Communications
4913 papers in training set
Top 46%
2.1%
10
Journal of Genetics and Genomics
36 papers in training set
Top 0.8%
1.8%
11
Nucleic Acids Research
1128 papers in training set
Top 10%
1.7%
12
Molecular Biology and Evolution
488 papers in training set
Top 2%
1.7%
13
Genome Biology and Evolution
280 papers in training set
Top 1%
1.5%
14
PLOS ONE
4510 papers in training set
Top 56%
1.5%
15
Genes
126 papers in training set
Top 2%
0.9%
16
Genome Biology
555 papers in training set
Top 7%
0.8%
17
Genetics
225 papers in training set
Top 4%
0.8%
18
Cell Discovery
54 papers in training set
Top 4%
0.8%
19
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 9%
0.8%
20
iScience
1063 papers in training set
Top 30%
0.8%
21
Communications Biology
886 papers in training set
Top 22%
0.8%
22
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
23
Bioinformatics
1061 papers in training set
Top 9%
0.8%
24
G3 Genes|Genomes|Genetics
351 papers in training set
Top 3%
0.7%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
26
European Journal of Human Genetics
49 papers in training set
Top 1%
0.7%
27
PLOS Biology
408 papers in training set
Top 22%
0.7%
28
GigaScience
172 papers in training set
Top 4%
0.5%