Back

The De Bruijn Mapping Problem with Changes in the Graph

B. Rocha, L.; Adi, S. S.; Araujo, E.

2024-02-15 bioinformatics
10.1101/2024.02.15.580401 bioRxiv
Show abstract

In computational biology, mapping a sequence s onto a sequence graph G poses a significant challenge. One possible approach to tackling this problem is to find a walk p in G that spells a sequence most similar to s. This challenge is formally known as the Graph Sequence Mapping Problem (GSMP). In this paper, we delve into an alternative problem formulation known as the De Bruijn Graph Sequence Mapping Problem (BSMP). Both problems have three variants: changes only in the sequence, changes in the graph, and changes in both the sequence and the graph. We concentrate on addressing the variant involving changes in the graph. In the literature, when this problem does not allow the De Bruijn graph to induce new arcs after changes, it becomes NP-complete, as proven by Gibney et. al [4]. However, we reformulate the problem by considering the characteristics of the arcs induced in the De Bruijn graph. This reformulation alters the problem definition, thereby enabling the application of a polynomial-time algorithm for its resolution. Approaching the problem with this arc-inducing characteristic is new, and the algorithm proposed in this work is new in the literature.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Journal of Computational Biology
37 papers in training set
Top 0.1%
17.3%
2
Bioinformatics
1061 papers in training set
Top 2%
17.3%
3
BMC Bioinformatics
383 papers in training set
Top 0.9%
10.0%
4
Algorithms for Molecular Biology
15 papers in training set
Top 0.1%
9.0%
50% of probability mass above
5
Scientific Reports
3102 papers in training set
Top 32%
3.9%
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.9%
7
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.1%
3.6%
8
PLOS ONE
4510 papers in training set
Top 40%
3.5%
9
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.8%
2.6%
10
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
11
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.6%
12
Genome Research
409 papers in training set
Top 2%
1.6%
13
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
14
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.4%
0.9%
15
iScience
1063 papers in training set
Top 25%
0.9%
16
Peer Community Journal
254 papers in training set
Top 3%
0.8%
17
Biomolecules
95 papers in training set
Top 2%
0.8%
18
Journal of Molecular Biology
217 papers in training set
Top 4%
0.7%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
20
Genes
126 papers in training set
Top 3%
0.7%
21
Cell Systems
167 papers in training set
Top 14%
0.6%