Back

HistoSB-Net: Semantic Bridging for Data-Limited Cross-Modal Histopathological Diagnosis

Bai, B.; Shih, T.-C.; Miyata, K.

2026-03-26 pathology
10.64898/2026.03.23.713838 bioRxiv
Show abstract

Vision-language models (VLMs) provide a unified framework for multimodal reasoning, yet their representations are primarily learned from natural image-text corpora and often exhibit semantic misalignment when transferred to histopathology, particularly under data-limited diagnostic settings. To address this limitation, we propose HistoSB-Net, a semantic bridging network designed to adapt pre-trained VLMs to multimodal histopathological diagnosis while preserving their original semantic structure. HistoSB-Net introduces a constrained semantic bridging (CSB) module that operates within the self-attention projection space of both vision and text encoders. Instead of employing explicit cross-attention or full fine-tuning, CSB adaptively modulates pre-trained attention projections through a lightweight nonlinear semantic bottleneck, enabling structured cross-modal regulation with limited additional parameters. The framework supports both patch-level and whole-slide image (WSI)-level diagnosis within a unified architecture. Experiments on six pathology benchmarks, comprising two WSI-level and four patch-level datasets, demonstrate consistent improvements over zero-shot inference across 36 backbone-dataset combinations under limited supervision. Further analysis of prototype-based margin distributions and confusion matrices shows that these improvements are accompanied by enhanced intra-class compactness and increased inter-class separation in the embedding space. These results indicate that CSB provides an effective and computationally manageable strategy for adapting pre-trained VLMs to data-limited digital pathology tasks.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Modern Pathology
21 papers in training set
Top 0.1%
18.9%
2
Medical Image Analysis
33 papers in training set
Top 0.1%
14.9%
3
Journal of Pathology Informatics
13 papers in training set
Top 0.1%
14.5%
4
Nature Communications
4913 papers in training set
Top 24%
7.3%
50% of probability mass above
5
npj Digital Medicine
97 papers in training set
Top 1%
3.6%
6
Nature Methods
336 papers in training set
Top 3%
2.5%
7
Scientific Reports
3102 papers in training set
Top 50%
2.1%
8
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.8%
1.9%
9
Advanced Science
249 papers in training set
Top 12%
1.5%
10
eBioMedicine
130 papers in training set
Top 2%
1.5%
11
PLOS ONE
4510 papers in training set
Top 56%
1.5%
12
Nature Machine Intelligence
61 papers in training set
Top 2%
1.3%
13
iScience
1063 papers in training set
Top 21%
1.2%
14
Communications Biology
886 papers in training set
Top 14%
1.2%
15
Cancer Research
116 papers in training set
Top 3%
1.2%
16
IEEE Transactions on Medical Imaging
18 papers in training set
Top 0.4%
1.0%
17
Science Translational Medicine
111 papers in training set
Top 5%
0.9%
18
ACS Nano
99 papers in training set
Top 4%
0.8%
19
Clinical Cancer Research
58 papers in training set
Top 2%
0.8%
20
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 42%
0.8%
21
Cell Reports Medicine
140 papers in training set
Top 7%
0.8%
22
Nature Medicine
117 papers in training set
Top 5%
0.8%
23
The Lancet Digital Health
25 papers in training set
Top 1%
0.8%
24
The Lancet Infectious Diseases
71 papers in training set
Top 3%
0.7%
25
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
26
Light: Science & Applications
16 papers in training set
Top 0.7%
0.7%
27
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.7%
28
Genome Biology
555 papers in training set
Top 8%
0.7%
29
Frontiers in Bioinformatics
45 papers in training set
Top 1%
0.7%
30
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.7%