Back

Complete Telomere-to-Telomere Assembly of the Y Chromosome in the Chinese Quartet

Wang, B.; Wan, S.; Zhang, P.; Zhang, Y.; Wang, X.; Dong, L.; Ye, K.; Yang, X.

2026-04-16 genomics
10.64898/2026.04.13.718326 bioRxiv
Show abstract

The complete assembly of the human Y chromosome remains a challenge due to its highly repetitive and complex structure. While complete telomere-to-telomere (T2T) assemblies have been generated for a few individuals, such high-quality resources for East Asian populations, particularly for well-characterized multi-omics reference cohorts, are still scarce. The Chinese Quartet, comprising monozygotic twin daughters and their parents, is a premier reference material for genomic studies, yet a T2T-level Y chromosome assembly for this pedigree was lacking. Here, we present a complete, gapless T2T assembly of the Y chromosome (designated CQ-chrY) from the father of the Chinese Quartet. This assembly was generated by integrating Oxford Nanopore ultra-long reads, PacBio HiFi reads, and Hi-C data, resulting in a sequence of 61.88 Mb. The assembly shows exceptional base accuracy (QV = 51.09) and structural completeness (GCI = 100; CRAQ AQI = 95.217). We completely resolved the 33.52 Mb Yq12 heterochromatic region and annotated 164 protein-coding genes and 51.03 Mb (82.47%) of repetitive sequences. This CQ-chrY assembly represents the third complete Chinese Y chromosome and fills the last gap in the T2T assemblies of the Quartet family, providing an invaluable paternal haplotype resource for expanding East Asian genomic standards and for studies on Y chromosome structural variation and evolution.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.3%
16.9%
2
Nature Communications
4913 papers in training set
Top 12%
13.9%
3
Cell Genomics
162 papers in training set
Top 0.2%
9.7%
4
Genome Biology
555 papers in training set
Top 1%
6.1%
5
Science
429 papers in training set
Top 6%
6.1%
50% of probability mass above
6
Nucleic Acids Research
1128 papers in training set
Top 4%
4.7%
7
Genome Medicine
154 papers in training set
Top 2%
4.2%
8
Journal of Genetics and Genomics
36 papers in training set
Top 0.6%
2.4%
9
Genome Research
409 papers in training set
Top 2%
2.3%
10
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.0%
11
Communications Biology
886 papers in training set
Top 6%
2.0%
12
Cell
370 papers in training set
Top 11%
1.7%
13
Nature
575 papers in training set
Top 11%
1.6%
14
Scientific Reports
3102 papers in training set
Top 61%
1.6%
15
National Science Review
22 papers in training set
Top 1%
1.6%
16
eLife
5422 papers in training set
Top 46%
1.4%
17
Cell Discovery
54 papers in training set
Top 3%
1.3%
18
Nature Genetics
240 papers in training set
Top 5%
1.3%
19
Molecular Plant
36 papers in training set
Top 1.0%
1.3%
20
Human Molecular Genetics
130 papers in training set
Top 2%
1.3%
21
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
22
DNA Research
23 papers in training set
Top 0.5%
0.8%
23
Protein & Cell
25 papers in training set
Top 3%
0.7%
24
Advanced Science
249 papers in training set
Top 21%
0.7%
25
The Plant Journal
197 papers in training set
Top 3%
0.7%