Telomere-to-telomere, accurate, and gapless genome assembly (TTAGGA) of the Korean Jindo dog with a single-contig Y chromosome
Choi, H.; Kim, J.-S.; Kwon, Y.; Park, S.; Jeon, S.; Bhak, J.; Shin, D.; Choi, Y.; An, K.; Ryu, D.-Y.; Paek, W. K.; Park, D.; Kim, J.; Sinding, M.-H. S.; Choe, Y.; Hyun, B.-R.; Lee, S.-k.; Bhak, J.
Show abstract
Complete canine reference genomes are essential for studies of structural variation, sex-chromosome evolution, and breed-specific architecture, yet existing references remain incomplete on the Y chromosome and retain internal gaps across multiple chromosomes. Here, we introduce TTAGGA (Telomere-to-Telomere, Accurate, Gapless Genome Assembly), a stricter completeness standard requiring chromosomes assembled end-to-end with zero internal gaps and Merqury consensus QV [≥] 50, and present Jindo1-G-TTAGGA, the first canine assembly to meet this standard, achieved under a stringent QV [≥] 60 threshold. From 813 Gb ([~]340x coverage) of multi-platform data (PacBio HiFi [~]150x, ONT ultra-long [~]103x, parental Illumina [~]40x per parent) generated from a male Korean Jindo dog, trio binning with hifiasm produced two haplotype-resolved assemblies of 2,441.6 Mb (Hap1, maternal, chrX-carrying) and 2,340.5 Mb (Hap2, paternal, chrY-carrying). Both haplotypes are gap-free at every internal position, with canonical telomeric repeats verified at both ends of all 39 chromosomes (tidk), Merqury consensus QV of 78.0 (Hap1) and 76.8 (Hap2), trio switch-error rates below 0.13%, BUSCO completeness of 99.3% (Hap1) and 96.4% (Hap2; the lower value reflects absent X-linked orthologues in the Y-bearing haplotype), and Genome Continuity Index values of 98.2 (Hap1) and 94.7 (Hap2). Hap2 carries a single 21,255,890 bp gap-free Y chromosome with TTAGGG telomeric repeats at the q-arm terminus and an acrocentric satellite-rich p-arm, representing a 5.4-fold increase over the 3.94 Mb chrY of ROS_Cfam_1.0 and adding approximately 14 Mb of newly resolved Y-linked sequence; this corresponds to roughly 79% of the cytogenetically estimated 27 Mb full-length canine chrY. Jindo1-G-TTAGGA provides a chromosome-scale, haplotype-resolved, gap-free canine reference for studies of canine structural variation, sex-chromosome evolution, and canid phylogenomics.
Matching journals
The top 7 journals account for 50% of the predicted probability mass.