Back

Towards a database capturing chromosome structure and function: symbols and syntax

Cook, P. R.; Marenduzzo, D.; Valei, Z.

2026-05-14 biophysics
10.64898/2026.05.14.724942 bioRxiv
Show abstract

Existing databases of interphase chromosome conformations typically store three-dimensional coordinates of genomic segments. However, since interphase chromatin is highly dynamic, such databases are dominated by transient configurations and unstructured regions, whose positions vary continuously between cells and over time, unlike folded proteins such as globin, which adopt similar structures in every cell. These drawbacks motivated the inception of a database based on strion (a portmanteau of a string capturing structure and function). A strion concisely describes the structure and activity of all transcription units in one cell, by retaining only functionally relevant positional information. Sets of strions describing structures in different cells sampled at different times are compiled into a super-strion. Then, 46 super-strions summarise the range of structure and activity of a human cell type, including information on all transcription units, how often each co-fires and co-clusters with others in transcription factories/hubs, enhancer interactomes and small-world expression networks. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=200 SRC="FIGDIR/small/724942v1_ufig1.gif" ALT="Figure 1"> View larger version (38K): org.highwire.dtl.DTLVardef@13a1263org.highwire.dtl.DTLVardef@18d2c78org.highwire.dtl.DTLVardef@162865corg.highwire.dtl.DTLVardef@1631d65_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 0.6%
18.0%
2
iScience
1063 papers in training set
Top 0.2%
12.0%
3
Nucleus
11 papers in training set
Top 0.1%
6.2%
4
Nature Communications
4913 papers in training set
Top 31%
6.2%
5
PLOS Computational Biology
1633 papers in training set
Top 6%
6.1%
6
Scientific Reports
3102 papers in training set
Top 29%
4.2%
50% of probability mass above
7
eLife
5422 papers in training set
Top 23%
3.8%
8
Scientific Data
174 papers in training set
Top 0.5%
3.6%
9
Journal of Molecular Biology
217 papers in training set
Top 0.7%
3.5%
10
Genome Biology
555 papers in training set
Top 3%
2.6%
11
Advanced Science
249 papers in training set
Top 8%
2.5%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.5%
13
Communications Biology
886 papers in training set
Top 4%
2.3%
14
Bioinformatics
1061 papers in training set
Top 7%
1.6%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.4%
16
Nature Methods
336 papers in training set
Top 5%
1.4%
17
Life Science Alliance
263 papers in training set
Top 0.6%
1.3%
18
Cells
232 papers in training set
Top 4%
1.2%
19
Nature Structural & Molecular Biology
218 papers in training set
Top 4%
1.2%
20
Genome Research
409 papers in training set
Top 3%
1.2%
21
Cell Systems
167 papers in training set
Top 10%
1.1%
22
Database
51 papers in training set
Top 0.7%
0.9%
23
Cell Reports
1338 papers in training set
Top 33%
0.8%
24
Nature
575 papers in training set
Top 15%
0.8%
25
International Journal of Molecular Sciences
453 papers in training set
Top 14%
0.8%
26
PLOS ONE
4510 papers in training set
Top 69%
0.7%