Back

Chromosome-level genome construction of a Japanese stickleback species using ultra-dense linkage analysis using single-cell sequencing of sperms

Yoshitake, K.; Ishikawa, A.; Yonezawa, R.; Kinoshita, S.; Kitano, J.; Asakawa, S.

2020-05-14 genomics
10.1101/2020.05.12.092221 bioRxiv
Show abstract

The presence of high quality genomes at the chromosome level is very useful in the search for the causal genes of mutants and in genetic breeding. The advent of next-generation sequencers has made it easier to decode genomes, but it is still difficult to construct the genomes of higher organisms. In order to construct the genome of a higher organism, the genome sequence of the organism is extended to the length of the chromosome by linkage analysis after assembly and scaffolding. However, in the past linkage analysis, it was difficult to make a high-density linkage map, and it was not possible to analyze organisms without an established breeding system. As an innovative alternative to conventional linkage analysis, we devised a method for genotyping sperm using 10x single-cell genome (CNV) sequencing libraries to generate a linkage map without interbreeding individuals. The genome was constructed using sperm from Gasterosteus nipponicus, and single-cell genotyping yielded 1,864,430 very dense hetero-SNPs. The average coverage per sperm cell is 0.13x. The number of sperm used is 1,738, which is an order of magnitude higher than the number of sperm used for conventional linkage analysis. We have improved the linkage analysis tool SELDLA (Scaffold Extender with Low Depth Linkage Analysis) so that we can analyze the data in accordance with the characteristics of single-cell genotyping data. Finally, we were able to determine the location and orientation on the chromosome for 85.6% of the contigs in the 456 Mbase genome of Gasterosteus nipponicus sequenced in nanopores. A total of 95.6% of the contigs in which a cross-reaction was detected within the contigs.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
DNA Research
23 papers in training set
Top 0.1%
18.4%
2
Scientific Reports
3102 papers in training set
Top 5%
10.5%
3
BMC Genomics
328 papers in training set
Top 0.2%
7.2%
4
PLOS ONE
4510 papers in training set
Top 27%
6.4%
5
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
3.6%
6
Frontiers in Genetics
197 papers in training set
Top 2%
3.6%
7
Molecular Ecology Resources
161 papers in training set
Top 0.4%
2.7%
50% of probability mass above
8
Genes
126 papers in training set
Top 0.5%
2.6%
9
Journal of Genetics and Genomics
36 papers in training set
Top 0.5%
2.6%
10
Gigabyte
60 papers in training set
Top 0.5%
2.1%
11
F1000Research
79 papers in training set
Top 1%
2.1%
12
eLife
5422 papers in training set
Top 38%
1.9%
13
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
1.7%
14
Genome Biology
555 papers in training set
Top 4%
1.7%
15
Database
51 papers in training set
Top 0.4%
1.7%
16
Nature Communications
4913 papers in training set
Top 52%
1.7%
17
GigaScience
172 papers in training set
Top 2%
1.5%
18
Genome Research
409 papers in training set
Top 3%
1.2%
19
Frontiers in Plant Science
240 papers in training set
Top 4%
1.1%
20
BMC Bioinformatics
383 papers in training set
Top 6%
1.0%
21
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
22
Communications Biology
886 papers in training set
Top 17%
1.0%
23
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.5%
0.9%
24
Aquaculture
29 papers in training set
Top 0.6%
0.8%
25
Frontiers in Microbiology
375 papers in training set
Top 8%
0.8%
26
The Plant Journal
197 papers in training set
Top 3%
0.8%
27
Bioinformatics Advances
184 papers in training set
Top 5%
0.8%
28
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
0.8%
29
Synthetic and Systems Biotechnology
10 papers in training set
Top 0.6%
0.7%
30
International Journal of Biological Macromolecules
65 papers in training set
Top 4%
0.7%