Back

A draft de novo assembly of Diadema antillarum, a keystone herbivore of the Caribbean reefs

Majeske, A. J.; Wong, J.; Farkas Pool, C.; EIRIN-LOPEZ, J.; Wolfsberger, W.; Schizas, N. V.; Diaz-Lameiro, A. M.; Castro-Marquez, S. O.; Hilkert, K.; Mercado Capote, A. J.; Oleksyk, T. K.

2026-05-27 genetics
10.64898/2026.05.24.727502 bioRxiv
Show abstract

We generated the first reference-level nuclear genome assembly of the keystone Caribbean long-spined black sea urchin species, Diadema antillarum (Philippi, 1845). Using whole-genome sequencing data from PacBio HiFi, Oxford Nanopore, and Illumina platforms, we employed multiple assembly strategies to generate a high-quality, near-complete genome. The final assembly spans 1.73 Gbp, consists of 2,964 scaffolds, and has an N50 of 1.56 Mbp. BUSCO analysis (metazoa_odb10) indicates 98.4% completeness. The genome displays a heterozygosity rate of 2.52% and contains 42.85% repetitive elements, of which 29.96% are unclassified. Coverage analysis reveals that while most of the genome was assembled at 11x depth, certain regions exhibit up to 530x coverage. Notably, regions exceeding 33x coverage account for 30.53% of the repetitive content, suggesting localized expansion of repeats. Duplication analysis of the assembled contigs shows that approximately 66% of contigs have duplicated, which supports segmental genome duplication in the past, and is further evidenced by the moderate level of heterozygosity of the assembly. While these characteristics contribute to the complexity of the genome, they do not diminish the quality of our assembly. Despite this complexity, our assembly maintains high completeness and contiguity. Our assembly provides a valuable resource for future genetic studies and serves as a critical framework for conservation, monitoring, and restoration of D. antillarum populations across the Caribbean.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.1%
14.3%
2
Scientific Reports
3102 papers in training set
Top 5%
10.4%
3
Molecular Ecology Resources
161 papers in training set
Top 0.1%
10.1%
4
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.1%
6.3%
5
Gigabyte
60 papers in training set
Top 0.2%
4.3%
6
Molecular Ecology
304 papers in training set
Top 1%
4.3%
7
PLOS ONE
4510 papers in training set
Top 34%
4.2%
50% of probability mass above
8
Frontiers in Genetics
197 papers in training set
Top 2%
4.0%
9
Genome Biology and Evolution
280 papers in training set
Top 0.4%
4.0%
10
Journal of Heredity
35 papers in training set
Top 0.1%
2.6%
11
BMC Genomics
328 papers in training set
Top 2%
2.1%
12
Frontiers in Ecology and Evolution
60 papers in training set
Top 2%
1.9%
13
Frontiers in Marine Science
55 papers in training set
Top 0.6%
1.7%
14
Open Biology
95 papers in training set
Top 0.8%
1.5%
15
BMC Biology
248 papers in training set
Top 2%
1.5%
16
PeerJ
261 papers in training set
Top 8%
1.5%
17
Nature Communications
4913 papers in training set
Top 57%
1.2%
18
Current Biology
596 papers in training set
Top 11%
1.2%
19
Communications Biology
886 papers in training set
Top 19%
0.9%
20
PLOS Genetics
756 papers in training set
Top 13%
0.9%
21
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
22
Ecology and Evolution
232 papers in training set
Top 4%
0.8%
23
Genes
126 papers in training set
Top 3%
0.7%
24
Scientific Data
174 papers in training set
Top 2%
0.7%
25
Peer Community Journal
254 papers in training set
Top 4%
0.6%
26
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 48%
0.6%
27
Journal of Genetics and Genomics
36 papers in training set
Top 3%
0.6%
28
Insect Molecular Biology
19 papers in training set
Top 0.2%
0.6%
29
BMC Ecology and Evolution
49 papers in training set
Top 2%
0.6%
30
Genomics
60 papers in training set
Top 3%
0.6%