Back

Chromosome-scale Salvia hispanica L. (Chia) genome assembly reveals rampant Salvia interspecies introgression

Brose, J.; Hamilton, J. P.; Schlecht, N.; Zhao, D.; Mejia-Ponce, P. M.; Cruz Perez, A.; Vaillancourt, B.; Wood, J. C.; Edger, P. P.; Montes-Hernandez, S.; Orozco de Rosas, G.; Hamberger, B.; Cibrian Jaramillo, A.; Buell, C. R.

2024-06-17 genomics
10.1101/2024.06.14.598901 bioRxiv
Show abstract

Salvia hispanica L. (Chia), a member of the Lamiaceae, is an economically important crop in Mesoamerica, with health benefits associated with its seed fatty acid composition. Chia varieties are distinguished based on seed color including mixed white and black (Chia pinta) and black (Chia negra). To facilitate research on Chia and expand on comparative analyses within the Lamiaceae, we generated a chromosome-scale assembly of a Chia pinta accession and performed comparative genome analyses with a previously published Chia negra genome assembly. The Chia pinta and negra genome sequences were highly similar as shown by a limited number of single nucleotide polymorphisms and extensive shared orthologous gene membership. There is an enrichment of terpene synthases in the Chia pinta genome relative to the Chia negra genome. We sequenced and analyzed the genomes of 20 Chia accessions with differing seed color and geographic origin revealing population structure within S. hispanica and interspecific introgressions of Salvia species. As the genus Salvia is polyphyletic, its evolutionary history remains unclear. Using large-scale synteny analysis within the Lamiaceae and orthologous group membership, we resolved the phylogeny of Salvia species. This study and its collective resources further our understanding of genomic diversity in this food crop and the extent of inter-species hybridizations in Salvia. PLAIN LANGUAGE SUMMARYChia pinta is an economically important crop due to the high fatty acid present in the seeds. There are multiple types of Chia based on the seeds color including mixed which and black (Chia pinta), black (Chia negra), and white (Chia blanca). We generated a genome assembly of Chia pinta and compared it to existing genome assemblies. While the assemblies are highly similar there are key differences in terpene synthase composition between Chia pinta and Chia negra. We also sequenced 20 other Chia accessions with different seed color and geographic origin to determine a population structure within Chia. We generated genomic resources to further our understanding of this food crop. ABBREVIATIONSBGC Biosynthetic gene cluster BUSCO Benchmarking Universal Single Copy Orthologs GO Gene ontology SNP Single nucleotide polymorphism TIR Terminal inverted repeat TPS Terpene synthase WGS Whole genome shotgun

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Plant Direct
81 papers in training set
Top 0.1%
27.9%
2
Frontiers in Plant Science
240 papers in training set
Top 0.9%
10.2%
3
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.3%
6.4%
4
The Plant Genome
53 papers in training set
Top 0.1%
6.4%
50% of probability mass above
5
PLANTS, PEOPLE, PLANET
21 papers in training set
Top 0.1%
4.9%
6
Scientific Reports
3102 papers in training set
Top 31%
4.0%
7
G3
33 papers in training set
Top 0.1%
3.6%
8
Applications in Plant Sciences
21 papers in training set
Top 0.1%
3.6%
9
Gigabyte
60 papers in training set
Top 0.3%
3.3%
10
The Plant Journal
197 papers in training set
Top 2%
3.3%
11
PLOS ONE
4510 papers in training set
Top 42%
3.1%
12
Plant Biotechnology Journal
56 papers in training set
Top 0.5%
2.4%
13
Scientific Data
174 papers in training set
Top 0.9%
1.9%
14
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.4%
1.7%
15
Horticulture Research
43 papers in training set
Top 1%
1.5%
16
New Phytologist
309 papers in training set
Top 4%
1.3%
17
DNA Research
23 papers in training set
Top 0.3%
1.3%
18
BMC Plant Biology
47 papers in training set
Top 0.8%
0.9%
19
Genetics
225 papers in training set
Top 4%
0.8%
20
Frontiers in Genetics
197 papers in training set
Top 10%
0.8%
21
Genes
126 papers in training set
Top 3%
0.8%
22
Agronomy
18 papers in training set
Top 0.9%
0.7%
23
Annals of Botany
43 papers in training set
Top 0.4%
0.7%
24
Plant Communications
35 papers in training set
Top 2%
0.6%