Back

Uniform annotation framework reveals genome size and LINE/LTR retrotransposons as predictors of gene family expansion across Coleoptera

Trabert, M.; Boman, J.; Immonen, E.

2026-03-27 evolutionary biology
10.64898/2026.03.25.714136 bioRxiv
Show abstract

Gene family evolution - the turnover of duplicated homologous genes - shapes genome architecture and fuels phenotypic innovation. Repetitive elements (REs) facilitate gene duplication and genome expansion, yet whether variation in repeat abundance and genome size (GS) scales with gene family evolution across species remains unclear. Coleoptera provides a well-suited system for examining these dynamics because of its major ecological diversification and extensive genome size variation. Comparative tests of these relationships are however hindered by heterogenous genome annotations that distort gene counts and orthogroup assignments. We first evaluate how repeat- and gene-annotation strategies influence gene- and orthogroup-detection across beetle genomes. We then apply unified re-annotations of both to identify rapidly evolving gene families and test whether GS and repeat content covary with gene family size evolution. Nearly 500 orthogroups are rapidly evolving in Coleoptera, many of which are linked to ecologically crucial functions such as chemosensory perception and detoxification. GS and RE abundance are correlated, and on average scale positively with gene family sizes. LINE and DNA transposable elements commonly flank rapidly expanding gene families, but with pronounced species-specific variation. Together, these findings position genome architecture and repeat dynamics as fundamental determinants of gene family evolvability.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Ecology & Evolution
113 papers in training set
Top 0.2%
13.8%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.4%
9.7%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 6%
9.7%
4
PLOS Biology
408 papers in training set
Top 0.7%
6.9%
5
eLife
5422 papers in training set
Top 15%
6.1%
6
Science
429 papers in training set
Top 6%
6.1%
50% of probability mass above
7
PLOS Genetics
756 papers in training set
Top 3%
4.7%
8
Current Biology
596 papers in training set
Top 5%
4.0%
9
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 1%
3.8%
10
Evolution Letters
71 papers in training set
Top 0.7%
3.4%
11
Nature Communications
4913 papers in training set
Top 41%
3.4%
12
Molecular Ecology
304 papers in training set
Top 2%
3.0%
13
The Plant Cell
141 papers in training set
Top 1%
1.7%
14
Science Advances
1098 papers in training set
Top 19%
1.6%
15
Cell Host & Microbe
113 papers in training set
Top 3%
1.4%
16
Genome Biology and Evolution
280 papers in training set
Top 1%
1.4%
17
BMC Biology
248 papers in training set
Top 2%
1.3%
18
Genetics
225 papers in training set
Top 3%
1.2%
19
Genome Research
409 papers in training set
Top 3%
1.1%
20
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 6%
0.9%
21
Evolution
199 papers in training set
Top 2%
0.9%
22
The American Naturalist
114 papers in training set
Top 2%
0.9%
23
New Phytologist
309 papers in training set
Top 5%
0.8%
24
Cell
370 papers in training set
Top 16%
0.8%
25
Nature Genetics
240 papers in training set
Top 7%
0.8%
26
Developmental Cell
168 papers in training set
Top 12%
0.7%