Back

Diatom Endosymbionts have Shrinking but Stable Genomes Despite Low Coding Density

Abresch, H.; Cruse, F.; Miller, S.; Li, J.; Hamsher, S.; Kociolek, J. P.; Miller, S. R.

2026-03-22 evolutionary biology
10.64898/2026.03.19.712447 bioRxiv
Show abstract

Successful establishment of long-term, obligate endosymbiotic relationships requires integration of hosts and endosymbionts across multiple levels. For example, highly integrated, host-beneficial endosymbionts typically have extremely reduced genomes and metabolisms. However, we do not yet fully understand the specific mechanisms that drive this integration or if there is a specific order in which these changes must occur. To investigate the early stages of endosymbiont genome reduction, we greatly expanded available whole genome data for the nitrogen-fixing endosymbionts (spheroid bodies, SBs) of diatoms in the family Rhopalodiaceae. We used these data to reconstruct SB evolutionary history and to characterize SB core metabolic capacity. We found two key genes missing from all SB genomes, mltA and dnaA, which could provide points of host control over SB cell division. Although most of the SB core genome is experiencing moderately strong purifying selection, we identified 54 genes under positive selection. Eighteen of these are peripheral proteins or involved in cell wall and cell membrane metabolism and could be involved in direct interactions with the host. Unexpectedly, we also found three nif genes under positive selection that are core to the central nitrogen-fixing enzyme. Overall, our results provide early insights into how SBs and their hosts interact, showing that SBs are still in the early stages of endosymbiont genome reduction, but they differ in key ways from current models, including the early loss of all mobile elements.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Genome Biology and Evolution
280 papers in training set
Top 0.1%
14.4%
2
PLOS Genetics
756 papers in training set
Top 1%
9.9%
3
mBio
750 papers in training set
Top 2%
8.2%
4
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 9%
7.0%
5
Molecular Biology and Evolution
488 papers in training set
Top 0.8%
6.2%
6
The Plant Cell
141 papers in training set
Top 0.6%
6.2%
50% of probability mass above
7
The ISME Journal
194 papers in training set
Top 0.5%
4.7%
8
Nature Communications
4913 papers in training set
Top 36%
4.2%
9
Current Biology
596 papers in training set
Top 5%
3.9%
10
Genome Biology
555 papers in training set
Top 2%
3.5%
11
eLife
5422 papers in training set
Top 30%
3.0%
12
New Phytologist
309 papers in training set
Top 3%
2.0%
13
Science
429 papers in training set
Top 14%
1.7%
14
Genome Research
409 papers in training set
Top 2%
1.6%
15
mSystems
361 papers in training set
Top 6%
1.3%
16
PLOS Biology
408 papers in training set
Top 13%
1.3%
17
Molecular Ecology
304 papers in training set
Top 3%
1.2%
18
BMC Biology
248 papers in training set
Top 2%
1.2%
19
Nature Ecology & Evolution
113 papers in training set
Top 3%
1.1%
20
Science Advances
1098 papers in training set
Top 27%
0.9%
21
Evolution Letters
71 papers in training set
Top 2%
0.8%
22
PLOS ONE
4510 papers in training set
Top 69%
0.7%
23
Cell Reports
1338 papers in training set
Top 34%
0.7%
24
PLOS Pathogens
721 papers in training set
Top 9%
0.7%
25
PLOS Computational Biology
1633 papers in training set
Top 28%
0.6%
26
Cell
370 papers in training set
Top 19%
0.6%
27
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 7%
0.6%
28
The Plant Journal
197 papers in training set
Top 4%
0.6%