Back

A multi-flow approach for binning circular plasmids from short-reads assembly graphs

Epain, V.; Mane, A.; Della Vedova, G.; Bonizzoni, P.; Chauve, C.

2026-03-26 genomics
10.64898/2026.03.25.714305 bioRxiv
Show abstract

We address the problem of plasmid binning, that aims to group contigs - from a draft short-read assembly for a bacterial sample - into bins each expected to correspond to a plasmid present in the sequenced bacterial genome. We formulate the plasmid binning problem as a network multi-flow problem in the assembly graph and describe a Mixed-Integer Linear Program to solve it. We compare our new method, PlasBin-HMF, with state-of-the-art methods,MOB-recon, gplasCC, and PlasBin-flow, on a dataset of more than 500 bacterial samples, and show that PlasBin-HMF outperforms the other methods, by preserving the explainability.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genome Research
409 papers in training set
Top 0.1%
19.4%
2
Bioinformatics
1061 papers in training set
Top 3%
10.4%
3
iScience
1063 papers in training set
Top 0.6%
8.4%
4
Nature Communications
4913 papers in training set
Top 25%
7.1%
5
Genome Biology
555 papers in training set
Top 1%
6.3%
50% of probability mass above
6
Nature Genetics
240 papers in training set
Top 1%
6.3%
7
Nature Biotechnology
147 papers in training set
Top 2%
4.8%
8
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.7%
10
BMC Bioinformatics
383 papers in training set
Top 3%
2.6%
11
Nature Computational Science
50 papers in training set
Top 0.5%
1.9%
12
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
13
Scientific Reports
3102 papers in training set
Top 58%
1.7%
14
PLOS ONE
4510 papers in training set
Top 55%
1.7%
15
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.5%
16
BMC Genomics
328 papers in training set
Top 3%
1.3%
17
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
18
Nature Methods
336 papers in training set
Top 5%
1.2%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.2%
20
Communications Biology
886 papers in training set
Top 14%
1.2%
21
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.4%
0.9%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 41%
0.9%
23
Cell Systems
167 papers in training set
Top 13%
0.7%
24
Genome Medicine
154 papers in training set
Top 9%
0.7%
25
Microbial Genomics
204 papers in training set
Top 2%
0.6%