Back

Phylogenomic synteny analysis tracks conserved ancient polyploid-derived triplicated genomic blocks across Asteraceae genomes

Feng, T.; McKibben, M.; Lovell, J.; Michelmore, R.; Rieseberg, L.; Barker, M.; Schranz, M. E.

2025-01-11 genomics
10.1101/2025.01.08.631874 bioRxiv
Show abstract

The Asteraceae (Compositae) is the largest flowering plant family, ubiquitous in most terrestrial communities, and morphologically hyper-diverse. An ancient whole genome triplication (paleo-hexaploidization) occurred at approximately the same time as the evolutionary innovation and adaptive radiation of the family during the middle Eocene. Despite its importance, the genomic contents arising from this triplication have yet to be tracked in context of the Asteraceae genome evolution. We applied a synteny oriented phylogenomic analysis of 21 Asterales genomes and to study the paleo-hexaploidization and its consequences to gene, trait, and genome evolution. We identified 15 ancestral linkage groups (ALGs) that date back to the common diploid ancestor of all Asteraceae. Each of these groups was triplicated, resulting in 45 genomic blocks (3x15), which serve as the foundation for cross-family analyses. We demonstrate the complex evolutionary dynamics of the 45 genomic blocks across the Asteraceae phylogeny. We found that modern genomes are genetic mosaics of three progenitor genomes by extensive genomic exchange, chromosomal shuffling and gene fractionation. 157 genes retained three paleo-hexaploid derived syntenic paralogs across most Asteraceae species. Transcription factors (TFs) and auxin-related genes are significantly overrepresented in the conserved triplets, and expression of the paleo-hexaploidy paralogs is spatiotemporally differentiated. These genes are involved in the development of floral capitulum, a remarkable morphological innovation of the family. The discovery of conserved triplicated genes can direct further study to understand the evolutionary innovation, and the synteny-phylogenomic framework and ALGs provide a comparative framework to characterize newly sequenced Asteraceae genomes.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Molecular Plant
36 papers in training set
Top 0.1%
22.7%
2
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.5%
10.2%
3
Journal of Genetics and Genomics
36 papers in training set
Top 0.1%
8.5%
4
Nature Communications
4913 papers in training set
Top 28%
6.4%
5
The Plant Journal
197 papers in training set
Top 1.0%
4.9%
50% of probability mass above
6
Plant Communications
35 papers in training set
Top 0.2%
4.9%
7
Horticulture Research
43 papers in training set
Top 0.5%
3.6%
8
Nature Plants
84 papers in training set
Top 0.5%
3.6%
9
Genome Biology
555 papers in training set
Top 3%
2.4%
10
eLife
5422 papers in training set
Top 34%
2.4%
11
Cell Reports
1338 papers in training set
Top 20%
2.1%
12
Cell Genomics
162 papers in training set
Top 2%
2.1%
13
New Phytologist
309 papers in training set
Top 3%
2.1%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 27%
2.1%
15
Communications Biology
886 papers in training set
Top 8%
1.7%
16
The Plant Cell
141 papers in training set
Top 1%
1.7%
17
Nature Genetics
240 papers in training set
Top 5%
1.5%
18
Cell Discovery
54 papers in training set
Top 4%
1.2%
19
Cell
370 papers in training set
Top 14%
1.2%
20
Plant Biotechnology Journal
56 papers in training set
Top 1.0%
0.9%
21
Protein & Cell
25 papers in training set
Top 2%
0.8%
22
National Science Review
22 papers in training set
Top 2%
0.8%
23
Science Advances
1098 papers in training set
Top 29%
0.8%
24
Molecular Biology and Evolution
488 papers in training set
Top 5%
0.7%
25
Scientific Reports
3102 papers in training set
Top 76%
0.7%
26
PNAS Nexus
147 papers in training set
Top 3%
0.6%
27
PLOS ONE
4510 papers in training set
Top 73%
0.5%
28
PLOS Genetics
756 papers in training set
Top 18%
0.5%
29
Science
429 papers in training set
Top 22%
0.5%