Back

Genome assembly and diagnostic DNA markers for sex of the largest freshwater fish in North America, the white sturgeon (Acipenser transmontanus).

Willis, S. C.; Smith, J.; Narum, S. R.

2025-11-17 genomics
10.1101/2025.11.14.688503 bioRxiv
Show abstract

Sturgeon and paddlefish represent some of the most early diverging branches of ray-finned fishes and have undergone at least one global and several lineage-specific whole genome duplication events. White sturgeon (Acipenser transmontanus), the largest freshwater fish in North America, have experienced at least two rounds of whole genome duplication, and may exhibit both di- and multi-valent meiotic segregation. Moreover, they exhibit contemporary ploidy variants due to spontaneous autopolyploidy, particularly in aquaculture. Nonetheless, as a species with several population segments that exhibit chronic recruitment failure, conservation aquaculture is an important part of their management. To facilitate the development of genetic tools to aid white sturgeon conservation, as well as a basis to understand how ploidy changes and variation historically and contemporarily shape the evolution of this species, we present a genome assembly for a white sturgeon from the Snake River, Idaho, USA. Analysis of sequence data used for assembly indicated a haploid genome size of approximately 1.5Gbp, implying tetraploidy (4N), while analysis of heterozygous k-mers from 21 to 41 bp suggest the genome reflects both 4N and 8N variants. The final genome assembly, scaffolded using linkage maps constructed from Fraser River and Snake River F1 families, contained 6.26Gbp in 832,145 scaffolds, consistent with published genome size estimates. Conserved ortholog completeness for this genome (90.5%; 22.8% single-copy and 67.7% duplicated) was similar to the putatively diploid sterlet sturgeon, and the largest linkage map-based scaffold was 55.2Gbp, though the N50 for this assembly was only 416Kbp, indicating the assembly remains fragmented. We demonstrate the utility of this assembly by identifying genomic regions significantly associated with sex. Genetic markers, designed for inclusion in an amplicon genotyping panel, predicted sex 96.6% and 81.5% correctly in females and males, respectively, providing a strong overall association ({square}2 p-value < 2.7x10-37) with some variation by geographic region. Article summaryFunctional polysomes (more than two chromosomes that pair in meiosis) are rare among vertebrates. White sturgeon, the largest freshwater fish in North America, are tetraploid and occasionally hexaploid. Several populations of this species are stagnant or declining, requiring aquaculture to bolster reproduction. We analyzed whole genomic data and assembled the genome of an individual from the unique Snake River, Idaho, population. Results indicate most genetic variants are consistent with tetraploidy, and the genome assembly, while fragmented, is largely complete. We demonstrate its utility by designing genetic markers for sex for use in conservation and commercial aquaculture.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Aquaculture
29 papers in training set
Top 0.1%
18.4%
2
Journal of Heredity
35 papers in training set
Top 0.1%
14.5%
3
Gigabyte
60 papers in training set
Top 0.1%
8.3%
4
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.1%
6.2%
5
Evolutionary Applications
91 papers in training set
Top 0.2%
4.8%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 34%
4.3%
7
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.5%
4.1%
8
Frontiers in Marine Science
55 papers in training set
Top 0.3%
4.1%
9
Molecular Ecology
304 papers in training set
Top 2%
3.0%
10
Molecular Ecology Resources
161 papers in training set
Top 0.4%
2.7%
11
Scientific Reports
3102 papers in training set
Top 48%
2.3%
12
Ecology and Evolution
232 papers in training set
Top 2%
1.9%
13
PeerJ
261 papers in training set
Top 7%
1.7%
14
BMC Genomics
328 papers in training set
Top 3%
1.5%
15
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
16
Genes
126 papers in training set
Top 2%
1.3%
17
Journal of Fish Biology
14 papers in training set
Top 0.2%
1.2%
18
Developmental Dynamics
50 papers in training set
Top 0.7%
0.8%
19
Evolution & Development
18 papers in training set
Top 0.2%
0.7%
20
Genome Biology and Evolution
280 papers in training set
Top 2%
0.7%
21
Journal of Evolutionary Biology
98 papers in training set
Top 1%
0.7%
22
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 7%
0.6%
23
DNA Research
23 papers in training set
Top 0.6%
0.6%