Back

Improvements to the Gulf Pipefish Syngnathus scovelli Genome

Ramesh, B.; Small, C.; Healey, H.; Johnson, B.; Barker, E.; Currey, M.; Bassham, S.; Myers, M.; Cresko, W.; Jones, A.

2023-01-24 genomics
10.1101/2023.01.23.525209 bioRxiv
Show abstract

The Gulf pipefish Syngnathus scovelli has emerged as an important species in the study of sexual selection, development, and physiology, among other topics. The fish family Syngnathidae, which includes pipefishes, seahorses, and seadragons, has become an increasingly attractive target for comparative research in ecological and evolutionary genomics. These endeavors depend on having a high-quality genome assembly and annotation. However, the first version of the S. scovelli genome assembly was generated by short-read sequencing and annotated using a small set of RNA-sequence data, resulting in limited contiguity and a relatively poor annotation. Here, we present an improved genome assembly and an enhanced annotation, resulting in a new official gene set for S. scovelli. By using PacBio long-read high-fidelity (Hi-Fi) sequences and a proximity ligation (Hi-C) library, we fill small gaps and join the contigs to obtain 22 chromosome-level scaffolds. Compared to the previously published genome, the gaps in our novel genome assembly are smaller, the N75 is much larger (13.3 Mb), and this new genome is around 95% BUSCO complete. The precision of the gene models in the NCBIs eukaryotic annotation pipeline was enhanced by using a large body of RNA-Seq reads from different tissue types, leading to the discovery of 28,162 genes, of which 8,061 were non-coding genes. This new genome assembly and the annotation are tagged as a RefSeq genome by NCBI and thus provide substantially enhanced genomic resources for future research involving S. scovelli.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
DNA Research
23 papers in training set
Top 0.1%
25.8%
2
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.2%
18.5%
3
BMC Genomics
328 papers in training set
Top 0.4%
4.8%
4
Gigabyte
60 papers in training set
Top 0.2%
4.8%
50% of probability mass above
5
Frontiers in Genetics
197 papers in training set
Top 2%
3.6%
6
Molecular Ecology Resources
161 papers in training set
Top 0.3%
3.6%
7
Journal of Genetics and Genomics
36 papers in training set
Top 0.4%
3.2%
8
Scientific Reports
3102 papers in training set
Top 40%
3.2%
9
Genomics
60 papers in training set
Top 0.7%
2.1%
10
Scientific Data
174 papers in training set
Top 1.0%
1.8%
11
Aquaculture
29 papers in training set
Top 0.3%
1.7%
12
Genome Biology and Evolution
280 papers in training set
Top 1%
1.7%
13
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
1.7%
14
Microbiology Resource Announcements
22 papers in training set
Top 0.4%
1.7%
15
Genes
126 papers in training set
Top 1%
1.5%
16
Science China Life Sciences
26 papers in training set
Top 1%
1.3%
17
PeerJ
261 papers in training set
Top 10%
1.2%
18
BMC Biology
248 papers in training set
Top 2%
1.2%
19
PLOS ONE
4510 papers in training set
Top 63%
0.9%
20
GigaScience
172 papers in training set
Top 2%
0.9%
21
eLife
5422 papers in training set
Top 56%
0.8%
22
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.8%
23
Communications Biology
886 papers in training set
Top 26%
0.7%
24
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.6%
25
Nature Communications
4913 papers in training set
Top 66%
0.6%