Back

OpenBase: a universal framework for high-accuracy single-molecule detection of diverse non-canonical DNA bases using nanopore sequencing

Zhong, Z.; Xie, Y.-y.; Luo, R.; Chen, H.; Qiao, Z.; Ren, Z.; Zhang, Z.; Luo, G.

2026-05-16 bioinformatics
10.64898/2026.05.13.724850 bioRxiv
Show abstract

Nanopore sequencing holds great potential for the direct detection of non-canonical DNA bases from electrical signals, yet current approaches remain limited to a few classical epigenetic marks. Here we present OpenBase, an open and universal framework that standardizes training-data generation and deep learning based-modeling for single-molecule identification of diverse non-canonical DNA bases. OpenBase transforms nanopore sequencing into a broadly accessible platform for exploring the chemical diversity of DNA.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 8%
17.3%
2
Advanced Science
249 papers in training set
Top 1%
10.3%
3
Nature Biotechnology
147 papers in training set
Top 0.7%
10.0%
4
ACS Nano
99 papers in training set
Top 0.8%
4.8%
5
Nature Methods
336 papers in training set
Top 2%
4.3%
6
Genome Medicine
154 papers in training set
Top 2%
3.6%
50% of probability mass above
7
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
8
Nature Machine Intelligence
61 papers in training set
Top 1%
3.0%
9
Nano Letters
63 papers in training set
Top 1.0%
3.0%
10
Genome Biology
555 papers in training set
Top 3%
2.9%
11
Bioinformatics
1061 papers in training set
Top 6%
2.6%
12
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.4%
13
Cell Reports Methods
141 papers in training set
Top 2%
2.1%
14
Genome Research
409 papers in training set
Top 2%
1.9%
15
Cell Systems
167 papers in training set
Top 6%
1.9%
16
Small Methods
26 papers in training set
Top 0.3%
1.9%
17
PLOS ONE
4510 papers in training set
Top 54%
1.7%
18
Cell
370 papers in training set
Top 12%
1.7%
19
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
20
iScience
1063 papers in training set
Top 15%
1.7%
21
Scientific Reports
3102 papers in training set
Top 64%
1.3%
22
Nature Biomedical Engineering
42 papers in training set
Top 1%
1.3%
23
Science Advances
1098 papers in training set
Top 23%
1.2%
24
Communications Biology
886 papers in training set
Top 22%
0.8%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%