Back

ORFanes in mitochondrial genomes of marine polychaete Polydora

Selifanova, M.; Demianchenko, O.; Noskova, E.; Pitikov, E.; Skvortsov, D.; Drozd, J.; Vatolkina, N.; Apel, P.; Kolodyazhnaya, E.; Ezhova, M. A.; Tzetlin, A. B.; Neretina, T. V.; Knorre, D. A.

2023-02-04 evolutionary biology
10.1101/2023.02.04.527105 bioRxiv
Show abstract

Most characterised metazoan mitochondrial genomes are compact and encode a small set of proteins that are essential for oxidative phosphorylation. However, in rare cases, invertebrate taxa have additional open reading frames (ORFs) in their mtDNA sequences. Here, we sequenced and analysed the mitochondrial genome of a polychaete worm, Polydora cf. ciliata, part of whose life cycle takes place in low-oxygen conditions. In the mitogenome, we found three "ORFane" regions (1063, 427, and 519 bp) that have no resemblance to any standard metazoan mtDNA gene but lack stop codons in one of the reading frames. Similar regions are found in the mitochondrial genomes of three other Polydora species and Bocardiella hamata. All five species share the same gene order in their mitogenomes, which differ from that of other known spionidae mitogenomes. By analysing the ORFane sequences, we found that they are under negative selection pressure, contain conservative regions, and harbour predicted transmembrane domains.The codon adaptation indices (CAIs) of the ORFan genes were in the same range of values as the CAI of conventional protein-coding genes in corresponding mitochondrial genomes. Together, this suggests that ORFanes encode functional proteins. We speculate that the ORFanes originated from the conventional mitochondrial protein-coding genes which were duplicated when the Polydora/Bocardiella species complex separated from the rest of the Spionidae. Significance statementMetazoan mitochondrial genomes usually contain a conservative set of genes and features. However, mitogenomes of some species contain ORFanes - putative protein-coding genes without clear homology with other known sequences. In this study, we analysed three ORFanes in mitochondria of species of the genera Polydora and Bocardiella, which were absent in all other representatives of Spionidae. To the best of our knowledge, ORFanes havent been described in Annelida before. Sequence analysis of the ORFanes suggests they contain conservative regions and are likely translated into functional proteins. Our study features an uncommon case where new protein-coding genes emerged in the mitochondrial genomes of metazoa.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Genome Biology and Evolution
280 papers in training set
Top 0.1%
54.3%
50% of probability mass above
2
Molecular Biology and Evolution
488 papers in training set
Top 0.9%
5.1%
3
Molecular Phylogenetics and Evolution
61 papers in training set
Top 0.1%
4.1%
4
Molecular Ecology
304 papers in training set
Top 1%
4.1%
5
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 3%
2.5%
6
BMC Biology
248 papers in training set
Top 0.8%
2.0%
7
BMC Ecology and Evolution
49 papers in training set
Top 0.9%
1.8%
8
PeerJ
261 papers in training set
Top 7%
1.7%
9
PLOS Biology
408 papers in training set
Top 9%
1.7%
10
Frontiers in Ecology and Evolution
60 papers in training set
Top 2%
1.4%
11
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.4%
12
Evolution Letters
71 papers in training set
Top 1%
1.4%
13
eLife
5422 papers in training set
Top 48%
1.3%
14
BMC Genomics
328 papers in training set
Top 4%
1.2%
15
Communications Biology
886 papers in training set
Top 16%
1.0%
16
Journal of Evolutionary Biology
98 papers in training set
Top 0.8%
0.9%
17
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 5%
0.8%
18
Scientific Reports
3102 papers in training set
Top 73%
0.8%
19
Peer Community Journal
254 papers in training set
Top 4%
0.8%
20
Nature Communications
4913 papers in training set
Top 61%
0.8%
21
Journal of Molecular Evolution
21 papers in training set
Top 0.3%
0.8%
22
PLOS Genetics
756 papers in training set
Top 18%
0.5%
23
Open Biology
95 papers in training set
Top 3%
0.5%