Back

PanACRpred: Predicting Accessible Chromatin Regions in Pangenomes using Motif Chaining

Warr, M. J.; Dinh, T.; Root, B.; Onstott, E.; Yu, K.; Mudge, J.; Ramaraj, T.; Kahanda, I.; Mumey, B.

2026-02-06 bioinformatics
10.64898/2026.02.05.703812 bioRxiv
Show abstract

In this work, we investigate using motif subsequence features to predict whether a genomic region is accessible to regulatory proteins, i.e. an accessible chromatin region (ACR), enabling transcription of associated genes. We focus on plants, whose agricultural and ecological importance make them interesting and important organisms to study, and whose complex genomes provide important stress tests for our algorithm. We show that motif sequence similarity as found by co-linear chaining can be used in combination with machine learning models to effectively predict ACRs in genome assemblies.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.2%
21.9%
2
PLOS Computational Biology
1633 papers in training set
Top 3%
12.0%
3
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.1%
9.8%
4
Bioinformatics
1061 papers in training set
Top 4%
7.0%
50% of probability mass above
5
Nucleic Acids Research
1128 papers in training set
Top 4%
4.7%
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
3.9%
7
PLOS ONE
4510 papers in training set
Top 41%
3.5%
8
Genome Biology
555 papers in training set
Top 3%
3.5%
9
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.0%
10
Frontiers in Genetics
197 papers in training set
Top 3%
3.0%
11
BMC Genomics
328 papers in training set
Top 1%
2.4%
12
Genes
126 papers in training set
Top 1%
1.6%
13
Bioinformatics Advances
184 papers in training set
Top 3%
1.6%
14
Genomics
60 papers in training set
Top 2%
1.2%
15
GigaScience
172 papers in training set
Top 2%
1.2%
16
Scientific Reports
3102 papers in training set
Top 70%
0.9%
17
in silico Plants
24 papers in training set
Top 0.2%
0.9%
18
iScience
1063 papers in training set
Top 25%
0.9%
19
Epigenetics & Chromatin
42 papers in training set
Top 0.3%
0.8%
20
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.6%
0.8%
21
Nature Communications
4913 papers in training set
Top 62%
0.8%
22
Frontiers in Bioinformatics
45 papers in training set
Top 1%
0.7%
23
Cell Systems
167 papers in training set
Top 13%
0.7%
24
Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms
14 papers in training set
Top 0.2%
0.7%
25
Royal Society Open Science
193 papers in training set
Top 6%
0.6%
26
Heliyon
146 papers in training set
Top 8%
0.6%