Back

Functional CENP-B boxes are selectively conserved at centromeric regions

Jiao, C.; Goncharov, N.; Shmakova, A.; Chakraborty, C.; Fachinetti, D.

2026-05-26 genomics
10.64898/2026.05.25.727640 bioRxiv
Show abstract

Human centromeres are built over large stretches of repetitive, divergent -satellite DNA. Within these sequences lies a conserved, defined 17-bp sequence named the CENP-B box that is bound by the DNA-binding protein CENP-B. Recent studies have proposed that CENP-B box motifs along chromosome arms exist and may represent conserved, ectopic binding sites with functional relevance. Here, we evaluate the genomic distribution, conservation, and binding capacity of different CENP-B box motifs outside canonical centromeres. Analysis of thousands of complete human centromere assemblies reveals exceptional conservation of a unique and canonical CENP-B box motif within centromeres. This is in contrast with the high sequence variability and stochastic occurrence of more degenerate CENP-B box motifs along chromosome arms. Consistently, CENP-B binds only at canonical CENP-B box motifs embedded within -satellite sites, with no evidence of functional binding at any ectopic sites. Together, these results indicate an adaptive selection of canonical CENP-B boxes within centromeric regions, in contrast to random sequence occurrences for ectopic CENP-B box-like motifs.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 0.9%
14.5%
2
Nature Communications
4913 papers in training set
Top 19%
9.9%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 12%
6.3%
4
Genome Biology
555 papers in training set
Top 1%
6.2%
5
Nature Genetics
240 papers in training set
Top 2%
4.8%
6
Cell Reports
1338 papers in training set
Top 16%
3.5%
7
Genome Research
409 papers in training set
Top 1%
3.5%
8
Genetics
225 papers in training set
Top 1%
3.5%
50% of probability mass above
9
eLife
5422 papers in training set
Top 27%
3.5%
10
Scientific Reports
3102 papers in training set
Top 45%
2.6%
11
The EMBO Journal
267 papers in training set
Top 1%
2.0%
12
Science
429 papers in training set
Top 13%
2.0%
13
EMBO reports
136 papers in training set
Top 2%
2.0%
14
Nature Structural & Molecular Biology
218 papers in training set
Top 3%
1.9%
15
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 3%
1.9%
16
Life Science Alliance
263 papers in training set
Top 0.4%
1.7%
17
Molecular Cell
308 papers in training set
Top 7%
1.7%
18
PLOS Biology
408 papers in training set
Top 11%
1.6%
19
Communications Biology
886 papers in training set
Top 11%
1.5%
20
Current Biology
596 papers in training set
Top 11%
1.3%
21
Chromosome Research
18 papers in training set
Top 0.1%
1.2%
22
The Plant Journal
197 papers in training set
Top 3%
1.2%
23
PLOS Genetics
756 papers in training set
Top 12%
1.1%
24
Science Advances
1098 papers in training set
Top 25%
1.1%
25
EMBO Reports
88 papers in training set
Top 0.4%
0.9%
26
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.9%
27
Nature Biotechnology
147 papers in training set
Top 7%
0.9%
28
Genome Biology and Evolution
280 papers in training set
Top 2%
0.9%
29
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.8%
30
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.7%