Back

Predicting 3D Chromatin Interactions Using Transformer-Enhanced Deep Learning Models

Xu, K.; Shen, L.

2025-04-16 bioinformatics
10.1101/2025.04.10.647995 bioRxiv
Show abstract

The three-dimensional (3D) structure of the human genome is essential for regulating gene expression and cellular functions. Chromatin interactions bring distant genomic regions into physical contact, enabling processes like gene regulation, DNA replication, and repair. Disruptions in this organization can lead to diseases such as cancer and genetic disorders. In this study, we propose a Transformer-based deep learning model to predict the chromatin interactions from DNA sequences. By developing a streamlined and efficient data pipeline to handle the sparse and noisy high-throughput chromosome conformation capture (Hi-C) sequencing data, our approach improves both data processing speed and model performance. The Transformers ability to capture long-range interactions among genomic regions via attention mechanism, combined with nucleotide position encoding, enables more accurate predictions than purely convolution-based models. This work highlights the potential of Transformer-based network architectures to advance our understanding of genome organization and paves the way for future research with large datasets and advanced network designs.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 3%
10.0%
2
Frontiers in Genetics
197 papers in training set
Top 0.5%
8.3%
3
Bioinformatics
1061 papers in training set
Top 4%
6.3%
4
Advanced Science
249 papers in training set
Top 3%
6.3%
5
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.1%
6.3%
6
Briefings in Bioinformatics
326 papers in training set
Top 0.9%
6.3%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.6%
6.3%
8
Communications Biology
886 papers in training set
Top 0.7%
4.8%
50% of probability mass above
9
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.4%
3.9%
10
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
11
NAR Genomics and Bioinformatics
214 papers in training set
Top 1.0%
2.9%
12
Nature Communications
4913 papers in training set
Top 44%
2.7%
13
PLOS ONE
4510 papers in training set
Top 49%
2.1%
14
Scientific Reports
3102 papers in training set
Top 54%
1.9%
15
BMC Bioinformatics
383 papers in training set
Top 4%
1.9%
16
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
17
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.7%
18
iScience
1063 papers in training set
Top 16%
1.6%
19
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.4%
1.1%
20
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
21
Quantitative Biology
11 papers in training set
Top 0.5%
0.9%
22
Genome Research
409 papers in training set
Top 3%
0.9%
23
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.8%
24
BMC Genomics
328 papers in training set
Top 6%
0.7%
25
Journal of Computational Biology
37 papers in training set
Top 0.7%
0.7%
26
Frontiers in Immunology
586 papers in training set
Top 9%
0.6%