Back

Chromosome-scale genome assembly of Eustoma grandiflorum, the first complete genome sequence in family Gentianaceae

Shirasawa, K.; Arimoto, R.; Hirakawa, H.; Ishimorai, M.; Ghelfi, A.; Miyasaka, M.; Endo, M.; Kawabata, S.; Isobe, S.

2021-09-11 genomics
10.1101/2021.09.09.459690 bioRxiv
Show abstract

Eustoma grandiflorum (Raf.) Shinn., is an annual herbaceous plant native to the southern United States, Mexico, and the Greater Antilles. It has a large flower with a variety of colors and an important flower crop. In this study, we established a chromosome-scale de novo assembly of E. grandiflorum by integrating four genomic and genetic approaches: (1) Pacific Biosciences (PacBio) Sequel deep sequencing, (2) error correction of the assembly by Illumina short reads, (3) scaffolding by chromatin conformation capture sequencing (Hi-C), and (4) genetic linkage maps derived from an F2 mapping population. The 36 pseudomolecules and unplaced 64 scaffolds were created with total length of 1,324.8 Mb. Full-length transcript sequencing was obtained by PacBio Iso-Seq sequencing for gene prediction on the assembled genome, Egra_v1. A total of 36,619 genes were predicted on the genome as high confidence HC) genes. Of the 36,619, 25,936 were annotated functions by ZenAnnotation. Genetic diversity analysis was also performed for nine commercial E. grandiflorum varieties bred in Japan, and 254,205 variants were identified. This is the first report of the construction of reference genome sequences in E. grandiflorum as well as in the family Gentianaceae.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
DNA Research
23 papers in training set
Top 0.1%
18.6%
2
PLOS ONE
4510 papers in training set
Top 15%
12.5%
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.5%
10.5%
4
Frontiers in Plant Science
240 papers in training set
Top 1%
8.4%
50% of probability mass above
5
Scientific Reports
3102 papers in training set
Top 14%
6.8%
6
Journal of Genetics and Genomics
36 papers in training set
Top 0.2%
4.9%
7
Horticulture Research
43 papers in training set
Top 0.5%
3.6%
8
Gigabyte
60 papers in training set
Top 0.4%
2.9%
9
Frontiers in Genetics
197 papers in training set
Top 4%
2.1%
10
Plant Direct
81 papers in training set
Top 0.9%
2.1%
11
The Plant Journal
197 papers in training set
Top 2%
1.9%
12
Genes
126 papers in training set
Top 0.7%
1.9%
13
New Phytologist
309 papers in training set
Top 4%
1.3%
14
Genomics
60 papers in training set
Top 2%
1.2%
15
Scientific Data
174 papers in training set
Top 2%
1.0%
16
Plant Communications
35 papers in training set
Top 1%
1.0%
17
BMC Genomics
328 papers in training set
Top 4%
1.0%
18
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
19
International Journal of Biological Macromolecules
65 papers in training set
Top 3%
0.9%
20
Plant Molecular Biology
18 papers in training set
Top 0.2%
0.9%
21
PeerJ
261 papers in training set
Top 14%
0.8%
22
Molecular Plant
36 papers in training set
Top 1%
0.7%
23
Microbiology Resource Announcements
22 papers in training set
Top 1.0%
0.7%
24
eLife
5422 papers in training set
Top 61%
0.6%
25
The Plant Genome
53 papers in training set
Top 0.7%
0.6%