Back

Evolutionary persistence of a highly prevalent multicopy mitochondrial-derived nuclear insertion (Mega-NUMT) in Neotropical Drosophila flies

Montoliu-Nerin, M.; Strunov, A.; Heyworth, E.; Schneider, D. I.; Thoma, J.; Hua-Van, A.; Courret, C.; Klasson, L. J.; Miller, W. J.

2026-04-01 evolutionary biology
10.64898/2026.03.31.715258 bioRxiv
Show abstract

BackgroundAlthough strict maternal transmission of mitochondria is a general feature of animals and humans for ensuring homogeneity in mitochondrial DNA (mtDNA) across generations, exceptions were reported in the recent past. For example, some extremely rare but spectacular cases of heteroplasmy and paternal transmission in humans have questioned the universal evolutionary principle. Hence, as an alternative, the Mega-NUMT concept was coined to explain this discovery and was thereafter partly proven to exist. This concept expands on the quite common transfer of mtDNA fragments to the nucleus (NUMTs) by considering the existence of multicopy mitochondrial nuclear insertions. Mega-NUMT reports are currently restricted to a few cases in animals, including humans. However, even in humans, their detailed genomic organization, natural prevalence, and potential biological functions remain unclear. Methodology/Principal FindingsHere, we discovered that up to 60 full-sized mitochondrial genomes are integrated into the nuclear genome of the neotropical fruit fly Drosophila paulistorum using long-read sequencing and confirmed their presence by in situ hybridization. The copies are organized in one cluster on chromosome 3, which we, due to its similarity with the Mega-NUMT concept, designated the "Dpau Mega-NUMT". Contrary to the rarity in humans, this Mega-NUMT is found at high prevalence (40%) in both long-term laboratory lines and natural D. paulistorum populations of different semispecies. Additionally, the mitochondrial copies in the Mega-NUMT cluster are phylogenetically separated from the current mitotypes of D. paulistorum. Together, these observations suggest long-term maintenance of the Mega-NUMT in nature. Hence, we propose that the Dpau Mega-NUMT may have been transferred to the nuclear genome before D. paulistorum semispecies radiation and maintained at relatively high prevalence in nature by balancing selection due to yet undetermined functions. Conclusions/SignificanceTo our knowledge, this is the first verified existence and detailed dissection of a Mega-NUMT outside cats and humans. We show that Mega-NUMTs can be persistent in nature, even at high prevalence, potentially due to balancing selection. Our findings strengthen the importance of high-quality long-read sequencing technologies for deciphering complex repeat-rich genomic regions to deepen our understanding of the dynamics of genome evolution within genomic "dark matter".

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Genome Biology and Evolution
280 papers in training set
Top 0.1%
26.4%
2
Molecular Ecology
304 papers in training set
Top 0.9%
6.9%
3
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 0.7%
4.9%
4
PLOS ONE
4510 papers in training set
Top 35%
4.0%
5
Biology Letters
66 papers in training set
Top 0.1%
3.7%
6
Genes
126 papers in training set
Top 0.2%
3.7%
7
BMC Biology
248 papers in training set
Top 0.4%
3.1%
50% of probability mass above
8
PLOS Genetics
756 papers in training set
Top 6%
2.7%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 25%
2.7%
10
Scientific Reports
3102 papers in training set
Top 49%
2.1%
11
Molecular Biology and Evolution
488 papers in training set
Top 2%
2.1%
12
Frontiers in Ecology and Evolution
60 papers in training set
Top 2%
2.1%
13
PeerJ
261 papers in training set
Top 6%
1.8%
14
Open Biology
95 papers in training set
Top 0.6%
1.7%
15
Journal of Evolutionary Biology
98 papers in training set
Top 0.5%
1.7%
16
Molecular Phylogenetics and Evolution
61 papers in training set
Top 0.2%
1.7%
17
BMC Ecology and Evolution
49 papers in training set
Top 0.9%
1.7%
18
Ecology and Evolution
232 papers in training set
Top 2%
1.7%
19
eLife
5422 papers in training set
Top 51%
1.0%
20
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 7%
1.0%
21
PLOS Biology
408 papers in training set
Top 16%
0.9%
22
Journal of Heredity
35 papers in training set
Top 0.2%
0.8%
23
Peer Community Journal
254 papers in training set
Top 4%
0.8%
24
iScience
1063 papers in training set
Top 31%
0.8%
25
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 6%
0.8%
26
Mobile DNA
27 papers in training set
Top 0.2%
0.8%
27
BMC Genomics
328 papers in training set
Top 6%
0.7%
28
Communications Biology
886 papers in training set
Top 25%
0.7%
29
Biology Open
130 papers in training set
Top 3%
0.7%
30
Nature Communications
4913 papers in training set
Top 65%
0.7%