Back

The draft genome sequence of Eucalyptus polybractea based on hybrid assembly with short- and long-reads reads

Li, T.; Kainer, D.; Foley, W. J.; Rodrigo, A.; Kuelheim, C.

2021-05-18 genomics
10.1101/2021.05.18.444652 bioRxiv
Show abstract

Eucalyptus polybractea is a small, multi-stemmed tree, which is widely cultivated in Australia for the production of Eucalyptus oil. We report the hybrid assembly of the E. polybractea genome utilizing both short- and long-read technology. We generated 44 Gb of Illumina HiSeq short reads and 8 Gb of Nanopore long reads, representing approximately 83x and 15x genome coverage, respectively. The hybrid-assembled genome, after polishing, contained 24,864 scaffolds with an accumulated length of 523 Mb (N50 = 40.3 kb; BUSCO-calculated genome completeness of 94.3%). The genome contained 35,385 predicted protein-coding genes detected by combining homology-based and de novo approaches. We have provided the first assembled genome based on hybrid sequences from the highly diverse Eucalyptus subgenus Symphyomyrtus, and revealed the value of including long-reads from Nanopore technology for enhancing the contiguity of the assembled genome, as well as for improving its completeness. We anticipate that the E. polybractea genome will be an invaluable resource supporting a range of studies in genetics, population genomics and evolution of related species in Eucalyptus.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
DNA Research
23 papers in training set
Top 0.1%
19.1%
2
Gigabyte
60 papers in training set
Top 0.1%
10.6%
3
Scientific Reports
3102 papers in training set
Top 6%
10.3%
4
The Plant Journal
197 papers in training set
Top 0.5%
8.6%
5
PLOS ONE
4510 papers in training set
Top 23%
7.3%
50% of probability mass above
6
Plant Biotechnology Journal
56 papers in training set
Top 0.2%
4.4%
7
Frontiers in Plant Science
240 papers in training set
Top 2%
3.7%
8
Scientific Data
174 papers in training set
Top 0.6%
3.1%
9
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.9%
2.5%
10
Frontiers in Genetics
197 papers in training set
Top 4%
1.8%
11
Plant Direct
81 papers in training set
Top 1%
1.7%
12
PeerJ
261 papers in training set
Top 7%
1.7%
13
Horticulture Research
43 papers in training set
Top 1%
1.5%
14
Genes
126 papers in training set
Top 1%
1.5%
15
Nature Communications
4913 papers in training set
Top 54%
1.4%
16
BMC Genomics
328 papers in training set
Top 3%
1.4%
17
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.2%
18
Microbiology Resource Announcements
22 papers in training set
Top 0.6%
1.0%
19
The Plant Genome
53 papers in training set
Top 0.5%
0.9%
20
International Journal of Molecular Sciences
453 papers in training set
Top 14%
0.8%
21
Molecular Ecology Resources
161 papers in training set
Top 1.0%
0.8%
22
Genomics
60 papers in training set
Top 2%
0.8%
23
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.9%
0.8%
24
International Journal of Biological Macromolecules
65 papers in training set
Top 4%
0.7%
25
Genome Biology
555 papers in training set
Top 8%
0.7%
26
Plant Communications
35 papers in training set
Top 2%
0.5%
27
G3
33 papers in training set
Top 0.7%
0.5%
28
Communications Biology
886 papers in training set
Top 31%
0.5%
29
Journal of Genetics and Genomics
36 papers in training set
Top 3%
0.5%
30
GigaScience
172 papers in training set
Top 4%
0.5%