Back

Inference of a genome-wide protein-coding gene set of the inshore hagfish Eptatretus burgeri

Yamaguchi, K.; Hara, Y.; Kaori, T.; Nishimura, O.; Smith, J.; Kadota, M.; Kuraku, S.

2020-07-26 genomics
10.1101/2020.07.24.218818 bioRxiv
Show abstract

The group of hagfishes (Myxiniformes) arose from agnathan (jawless vertebrate) lineages and is one of the only two extant cyclostome taxa, together with lampreys (Petromyzontiformes). Even though whole genome sequencing has been achieved for diverse vertebrate taxa, genome-wide sequence information has been highly limited for cyclostomes. Here we sequenced the genome of the inshore hagfish Eptatretus burgeri using DNA extracted from the testis, with a short-read sequencing platform, aiming at reconstructing a high-coverage coding gene catalogue. The obtained genome assembly, scaffolded with mate-pair reads and paired RNA-seq reads, exhibited an N50 scaffold length of 293 Kbp, which allowed the genome-wide prediction of coding genes. This computation resulted in the gene models whose completeness was estimated at the complete coverage of more than 83 % and the partial coverage of more than 93 % by referring to evolutionarily conserved single-copy orthologs. The high contiguity of the assembly and completeness of resulting gene models promises a high utility in various comparative analyses including phylogenomics and phylome exploration.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
DNA Research
23 papers in training set
Top 0.1%
14.5%
2
Scientific Reports
3102 papers in training set
Top 4%
12.2%
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.8%
8.3%
4
Gigabyte
60 papers in training set
Top 0.1%
6.2%
5
Frontiers in Genetics
197 papers in training set
Top 2%
3.9%
6
PeerJ
261 papers in training set
Top 2%
3.9%
7
BMC Genomics
328 papers in training set
Top 0.9%
3.5%
50% of probability mass above
8
Genomics
60 papers in training set
Top 0.4%
3.5%
9
GigaScience
172 papers in training set
Top 0.5%
3.5%
10
Genes
126 papers in training set
Top 0.4%
2.8%
11
Journal of Genetics and Genomics
36 papers in training set
Top 0.5%
2.7%
12
International Journal of Molecular Sciences
453 papers in training set
Top 4%
2.4%
13
International Journal of Biological Macromolecules
65 papers in training set
Top 1.0%
2.4%
14
Molecular Ecology Resources
161 papers in training set
Top 0.5%
2.0%
15
Scientific Data
174 papers in training set
Top 1.0%
1.8%
16
Communications Biology
886 papers in training set
Top 10%
1.6%
17
Genome Biology and Evolution
280 papers in training set
Top 1%
1.5%
18
PLOS ONE
4510 papers in training set
Top 59%
1.3%
19
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 6%
1.2%
20
Microbiology Resource Announcements
22 papers in training set
Top 0.5%
1.2%
21
Journal of Molecular Evolution
21 papers in training set
Top 0.3%
0.9%
22
Heliyon
146 papers in training set
Top 5%
0.9%
23
Viruses
318 papers in training set
Top 5%
0.8%
24
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
0.8%
25
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
26
F1000Research
79 papers in training set
Top 5%
0.7%
27
Frontiers in Microbiology
375 papers in training set
Top 10%
0.6%