Back

Telomere-to-telomere, accurate, and gapless genome assembly (TTAGGA) of the Korean Jindo dog with a single-contig Y chromosome

Choi, H.; Kim, J.-S.; Kwon, Y.; Park, S.; Jeon, S.; Bhak, J.; Shin, D.; Choi, Y.; An, K.; Ryu, D.-Y.; Paek, W. K.; Park, D.; Kim, J.; Sinding, M.-H. S.; Choe, Y.; Hyun, B.-R.; Lee, S.-k.; Bhak, J.

2026-05-20 genomics
10.64898/2026.05.17.725804 bioRxiv
Show abstract

Complete canine reference genomes are essential for studies of structural variation, sex-chromosome evolution, and breed-specific architecture, yet existing references remain incomplete on the Y chromosome and retain internal gaps across multiple chromosomes. Here, we introduce TTAGGA (Telomere-to-Telomere, Accurate, Gapless Genome Assembly), a stricter completeness standard requiring chromosomes assembled end-to-end with zero internal gaps and Merqury consensus QV [≥] 50, and present Jindo1-G-TTAGGA, the first canine assembly to meet this standard, achieved under a stringent QV [≥] 60 threshold. From 813 Gb ([~]340x coverage) of multi-platform data (PacBio HiFi [~]150x, ONT ultra-long [~]103x, parental Illumina [~]40x per parent) generated from a male Korean Jindo dog, trio binning with hifiasm produced two haplotype-resolved assemblies of 2,441.6 Mb (Hap1, maternal, chrX-carrying) and 2,340.5 Mb (Hap2, paternal, chrY-carrying). Both haplotypes are gap-free at every internal position, with canonical telomeric repeats verified at both ends of all 39 chromosomes (tidk), Merqury consensus QV of 78.0 (Hap1) and 76.8 (Hap2), trio switch-error rates below 0.13%, BUSCO completeness of 99.3% (Hap1) and 96.4% (Hap2; the lower value reflects absent X-linked orthologues in the Y-bearing haplotype), and Genome Continuity Index values of 98.2 (Hap1) and 94.7 (Hap2). Hap2 carries a single 21,255,890 bp gap-free Y chromosome with TTAGGG telomeric repeats at the q-arm terminus and an acrocentric satellite-rich p-arm, representing a 5.4-fold increase over the 3.94 Mb chrY of ROS_Cfam_1.0 and adding approximately 14 Mb of newly resolved Y-linked sequence; this corresponds to roughly 79% of the cytogenetically estimated 27 Mb full-length canine chrY. Jindo1-G-TTAGGA provides a chromosome-scale, haplotype-resolved, gap-free canine reference for studies of canine structural variation, sex-chromosome evolution, and canid phylogenomics.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 6%
18.4%
2
Genome Medicine
154 papers in training set
Top 0.5%
10.0%
3
Science
429 papers in training set
Top 5%
6.3%
4
Scientific Data
174 papers in training set
Top 0.3%
4.8%
5
Emerging Infectious Diseases
103 papers in training set
Top 0.3%
4.8%
6
Nature
575 papers in training set
Top 6%
4.5%
7
Scientific Reports
3102 papers in training set
Top 28%
4.2%
50% of probability mass above
8
PLOS ONE
4510 papers in training set
Top 40%
3.5%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 21%
3.5%
10
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.3%
11
Genome Research
409 papers in training set
Top 2%
2.1%
12
Genome Biology
555 papers in training set
Top 4%
2.1%
13
Communications Biology
886 papers in training set
Top 7%
1.9%
14
Nucleic Acids Research
1128 papers in training set
Top 10%
1.9%
15
Cell
370 papers in training set
Top 12%
1.6%
16
Peer Community Journal
254 papers in training set
Top 2%
1.5%
17
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
18
eLife
5422 papers in training set
Top 48%
1.3%
19
Nature Genetics
240 papers in training set
Top 6%
1.2%
20
Nature Medicine
117 papers in training set
Top 4%
0.9%
21
Nature Ecology & Evolution
113 papers in training set
Top 4%
0.9%
22
BMC Genomics
328 papers in training set
Top 5%
0.9%
23
PLOS Genetics
756 papers in training set
Top 13%
0.9%
24
Science Advances
1098 papers in training set
Top 29%
0.8%
25
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
26
Genes
126 papers in training set
Top 3%
0.8%
27
Nature Methods
336 papers in training set
Top 6%
0.7%
28
Cell Reports
1338 papers in training set
Top 34%
0.7%
29
PLOS Medicine
98 papers in training set
Top 5%
0.7%
30
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 6%
0.7%