Back

BEGA-UNet: Boundary-Explicit Guided Attention U-Net with Multi-Scale Feature Aggregation for Colonoscopic Polyp Segmentation

Tong, T.; Zhang, W.; Zu, W.

2026-03-05 gastroenterology
10.64898/2026.03.04.26347608 medRxiv
Show abstract

Accurate polyp segmentation from colonoscopy images is critical for colorectal cancer prevention, yet the generalization of deep learning models under domain shift remains insufficiently explored. We propose Boundary-Explicit Guided Attention U-Net (BEGA-UNet), a boundary-aware segmentation architecture that introduces explicit edge modeling as a structural inductive bias to enhance both segmentation accuracy and cross-domain robustness. The framework integrates three components: an Edge-Guided Module (EGM) with learnable Sobel-initialized operators to capture boundary cues, a Dual-Path Attention (DPA) module that processes channel and spatial attention in parallel, and a Multi-Scale Feature Aggregation (MSFA) module to encode contextual information across multiple receptive fields. Evaluated on the combined Kvasir-SEG and CVC-ClinicDB benchmarks, BEGA-UNet achieves 88.53% Dice and 82.51% IoU, outperforming representative convolutional and transformer-based baselines. More importantly, cross-dataset evaluation demonstrates strong robustness under domain shift, with BEGA-UNet retaining 83.2% of its in-distribution performance--substantially higher than U-Net (64.5%), Attention U-Net (47.5%), and TransUNet (53.1%). In a zero-shot setting on an entirely unseen dataset, the model further maintains 72.6% performance retention. Comprehensive ablation studies indicate that explicit boundary modeling plays a central role in improving generalization, while multi-scale context aggregation further stabilizes performance across domains. Feature distribution analyses support this observation by showing that edge-oriented representations exhibit markedly reduced cross-domain variability compared to appearance-driven features. Overall, BEGA-UNet provides an effective and interpretable solution for robust polyp segmentation, demonstrating that explicit boundary modeling serves as a critical inductive bias for ensuring reliability under clinical domain shifts.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Nature Biomedical Engineering
42 papers in training set
Top 0.1%
43.3%
2
Nature Communications
4913 papers in training set
Top 16%
11.0%
50% of probability mass above
3
Medical Image Analysis
33 papers in training set
Top 0.2%
5.3%
4
Scientific Reports
3102 papers in training set
Top 31%
3.9%
5
Nature Machine Intelligence
61 papers in training set
Top 1%
3.1%
6
Cell Reports Medicine
140 papers in training set
Top 2%
2.3%
7
npj Digital Medicine
97 papers in training set
Top 2%
2.3%
8
Nature Medicine
117 papers in training set
Top 2%
1.9%
9
Med
38 papers in training set
Top 0.2%
1.9%
10
PLOS ONE
4510 papers in training set
Top 55%
1.6%
11
eBioMedicine
130 papers in training set
Top 2%
1.3%
12
IEEE Transactions on Medical Imaging
18 papers in training set
Top 0.3%
1.3%
13
Science Translational Medicine
111 papers in training set
Top 4%
1.0%
14
Communications Medicine
85 papers in training set
Top 0.6%
1.0%
15
iScience
1063 papers in training set
Top 28%
0.9%
16
Cancer Immunology, Immunotherapy
11 papers in training set
Top 0.3%
0.8%
17
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.7%
18
npj Precision Oncology
48 papers in training set
Top 2%
0.5%
19
BMC Medicine
163 papers in training set
Top 8%
0.5%
20
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 1%
0.5%
21
Frontiers in Medicine
113 papers in training set
Top 8%
0.5%
22
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.5%
23
Advanced Science
249 papers in training set
Top 23%
0.5%
24
Nature Methods
336 papers in training set
Top 7%
0.5%
25
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.5%