Back

The reference genome of an endangered Asteraceae, Deinandra increscens subsp. villosa, endemic to the Central Coast of California

McEvoy, S. L.; Meyer, R. S.; Hasenstab-Lehman, K. E.; Guilliams, C. M.

2024-02-26 genomics
10.1101/2024.02.25.582000 bioRxiv
Show abstract

We present a high-quality reference genome of the federally endangered Gaviota tarplant, Deinandra increscens subsp. villosa (Madiinae, Asteraceae), an annual herb endemic to the Central California coast. Stewards of remaining populations have planned to apply conservation strategies informed by whole genome approaches. Generating PacBio Hifi, Oxford Nanopore Technologies, and Dovetail Omni-C data, we assembled a genome of 1.67 Gbp as 28.7 K scaffolds with a scaffold N50 of 74.9 Mb. BUSCO completeness for the final assembly was 98.1% with 15.7% duplicate copies. We annotated repeat content in 74.8% of the genome. Long terminal repeats (LTR) covered 44.0% of the genome with Copia families predominant at 22.9% followed by Gypsy at 14.2%. Both Gypsy and Copia elements were common in ancestral peaks of LTR, and the most abundant element was a Gypsy element containing nested Copia/Angela sequenced similarity, reflecting a complex evolutionary history of repeat activity. Gene annotation produced 41,039 genes and 69,563 transcripts, of which >99% were functionally annotated. BUSCO duplication rates remained very high with proteins at 50.4% complete duplicates and 46.0% single copy. Whole genome duplication (WGD) synonymous mutation rates of Gaviota tarplant and sunflower (Helianthus annuus) shared peaks that correspond to the last Asteraceae polyploidization event and subsequent divergence from a common ancestor at [~]27 mya. Tandem genes were twice as prevalent as WGD genes suggesting tandem genes could be an important strategy of environmental adaptation in this species. Article SummaryWe introduce a high-quality reference genome for the endangered Gaviota tarplant. The assembly is 1.67 Gbp with 98.1% BUSCO completeness and 41 K annotated genes. We find extensive Copia long terminal repeat sequences and tandem genes that suggest environmental adaptation strategies. Comparisons with sunflower suggest a shared polyploidization event around 27 million years ago, close to the date of the common ancestor divergence. This work underlines the importance of genomic studies in accurately understanding adaptations and conservation needs.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Frontiers in Plant Science
240 papers in training set
Top 0.2%
22.3%
2
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.1%
12.2%
3
Gigabyte
60 papers in training set
Top 0.1%
10.3%
4
Applications in Plant Sciences
21 papers in training set
Top 0.1%
6.7%
50% of probability mass above
5
The Plant Journal
197 papers in training set
Top 1%
3.9%
6
PLOS ONE
4510 papers in training set
Top 40%
3.6%
7
The Plant Genome
53 papers in training set
Top 0.2%
3.6%
8
Plant Direct
81 papers in training set
Top 0.7%
3.0%
9
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.8%
2.7%
10
G3
33 papers in training set
Top 0.1%
2.1%
11
Genome Biology and Evolution
280 papers in training set
Top 0.9%
1.9%
12
DNA Research
23 papers in training set
Top 0.3%
1.7%
13
Scientific Reports
3102 papers in training set
Top 59%
1.7%
14
PLANTS, PEOPLE, PLANET
21 papers in training set
Top 0.4%
1.7%
15
Annals of Botany
43 papers in training set
Top 0.3%
1.6%
16
Genes
126 papers in training set
Top 1%
1.5%
17
PeerJ
261 papers in training set
Top 9%
1.3%
18
BMC Genomics
328 papers in training set
Top 4%
1.2%
19
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
20
New Phytologist
309 papers in training set
Top 5%
0.7%
21
Ecology and Evolution
232 papers in training set
Top 4%
0.7%
22
Nature Communications
4913 papers in training set
Top 64%
0.7%
23
Scientific Data
174 papers in training set
Top 3%
0.7%
24
Frontiers in Marine Science
55 papers in training set
Top 1%
0.6%