Back

A Fractal-Dimension Framework for Quantifying Self-Similarity in Chromatin Folding

El-Yaagoubi, A.; Balubaid, A. O.; Chung, M. K.; tegner, j.; Ombao, H.

2026-05-09 bioinformatics
10.64898/2026.05.06.723123 bioRxiv
Show abstract

The three-dimensional folding of DNA is essential for genome function, but its organization remains difficult to summarize quantitatively across genomic scales. Here, we study DNA folding from Hi-C contact data using a network-based notion of fractal dimension. In this representation, genomic loci are treated as nodes, and observed Hi-C contacts define weighted edges, so that frequently interacting loci are closer in the resulting network. We then estimate fractal dimension using two complementary graph-based methods: the correlation dimension and the sandbox dimension. Validation on synthetic networks shows that the proposed estimators detect clear scaling behavior in hierarchical fractal-like networks, while distinguishing them from networks with local clustering but no stable multiscale self-similarity. Applied to intrachromosomal Hi-C data from the IMR90 human cell line, the method reveals approximate linear scaling regimes on log-log plots, suggesting fractal-like organization in chromatin contact networks. At the chromosome level, estimated fractal dimension tends to increase with chromosome size: larger chromosomes often have dimensions closer to 3, consistent with more compact and space-filling organization, whereas shorter chromosomes tend to have lower dimensions, closer to 1, consistent with simpler and more open folding patterns. A sliding-window analysis at 5 kb resolution further shows that fractal organization varies substantially along chromosomes rather than remaining uniform across genomic position. These results suggest that graph-based fractal dimension provides an interpretable summary of DNA folding complexity at both global and local scales. More broadly, the proposed framework offers a quantitative way to study multiscale genome organization from Hi-C data using tools from network geometry.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 0.2%
25.7%
2
PLOS Computational Biology
1633 papers in training set
Top 1%
17.4%
3
Bioinformatics
1061 papers in training set
Top 3%
10.0%
50% of probability mass above
4
Scientific Reports
3102 papers in training set
Top 18%
6.3%
5
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.4%
4.3%
6
BMC Bioinformatics
383 papers in training set
Top 2%
3.8%
7
Nature Communications
4913 papers in training set
Top 41%
3.6%
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.9%
9
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
10
Cell Systems
167 papers in training set
Top 7%
1.8%
11
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
12
Communications Biology
886 papers in training set
Top 10%
1.7%
13
Advanced Science
249 papers in training set
Top 13%
1.5%
14
Genome Biology
555 papers in training set
Top 5%
1.5%
15
Genome Research
409 papers in training set
Top 3%
1.3%
16
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 38%
1.2%
17
iScience
1063 papers in training set
Top 27%
0.9%
18
Journal of Molecular Biology
217 papers in training set
Top 3%
0.9%
19
Nano Letters
63 papers in training set
Top 3%
0.8%
20
Physical Review E
95 papers in training set
Top 1%
0.8%
21
eLife
5422 papers in training set
Top 58%
0.7%
22
PLOS ONE
4510 papers in training set
Top 68%
0.7%
23
Biophysical Journal
545 papers in training set
Top 6%
0.6%