Back

SegBio: A lightweight end-to-end toolkit for Instance Segmentation of biological samples

Bokman, E.; Barlam, N.; Babay, O.; Balshayi, Y.; Eliezer, Y.; Zaslaver, A.

2026-04-06 bioinformatics
10.64898/2026.04.03.716031 bioRxiv
Show abstract

High-throughput phenotyping of biological samples is essential for large-scale studies but is frequently bottlenecked by the need for accurate instance segmentation in crowded images. While deep learning offers powerful solutions, the high cost of manual annotation and the requirement for coding expertise often limit adoption in routine laboratory workflows. Here we present SegBio, a lightweight, open-source pipeline that enables end-to-end instance segmentation for non-expert users. The protocol features an interactive annotation GUI that extrapolates full masks from minimal centerline markings, significantly reducing manual labeling effort. It further integrates a configurable U-Net training module and a standalone inference application with a human-in-the-loop editing workflow for rapid and intuitive error correction. We employ the pipeline to annotate and train the model on a novel dataset of crowded C. elegans images. Validated on independent datasets, SegBio achieves high segmentation performance (Panoptic quality [~]0.85) and accurately quantifies per-animal morphology and fluorescence. By eliminating external dependencies and streamlining the correction process, SegBio provides a scalable solution for routine phenotyping that is easily generalized to other crowded biological samples, such as cellular organelles, cells, and organisms.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Methods
336 papers in training set
Top 0.3%
21.9%
2
Nature Communications
4913 papers in training set
Top 18%
10.1%
3
Briefings in Bioinformatics
326 papers in training set
Top 0.6%
8.0%
4
Bioinformatics
1061 papers in training set
Top 4%
6.2%
5
Genome Medicine
154 papers in training set
Top 2%
4.2%
50% of probability mass above
6
Nature Machine Intelligence
61 papers in training set
Top 0.7%
4.2%
7
Genome Biology
555 papers in training set
Top 3%
3.5%
8
Nucleic Acids Research
1128 papers in training set
Top 6%
3.5%
9
BMC Bioinformatics
383 papers in training set
Top 3%
3.5%
10
PLOS Computational Biology
1633 papers in training set
Top 12%
2.8%
11
Communications Biology
886 papers in training set
Top 6%
2.0%
12
Cell Reports Methods
141 papers in training set
Top 2%
1.8%
13
Genome Research
409 papers in training set
Top 2%
1.7%
14
Scientific Reports
3102 papers in training set
Top 57%
1.7%
15
PLOS ONE
4510 papers in training set
Top 55%
1.6%
16
Nature Biotechnology
147 papers in training set
Top 5%
1.4%
17
GigaScience
172 papers in training set
Top 2%
1.4%
18
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.3%
19
Bioinformatics Advances
184 papers in training set
Top 3%
1.3%
20
iScience
1063 papers in training set
Top 23%
1.2%
21
Advanced Science
249 papers in training set
Top 17%
0.9%
22
Cell Systems
167 papers in training set
Top 11%
0.9%
23
Patterns
70 papers in training set
Top 3%
0.7%
24
eLife
5422 papers in training set
Top 62%
0.6%