Back

Multi-Scale Tri-Modal Histology Dataset Integrating Tumor Morphology, Immune Patterns, and Clinical Outcomes

Jung, K. J.; Qiu, J.; Cho, S.; McDonough, E.; Chadwick, C.; Ghose, S.; West, R. B.; Brooks, J. D.; Ginty, F.; Machiraju, R.; Mallick, P.

2026-05-19 bioinformatics
10.64898/2026.05.15.725535 bioRxiv
Show abstract

Accurate prognostic assessment of prostate cancer (PCa) requires an integrated understanding of tissue morphology-encompassing cell structure, glandular architecture, and tissue organization-and the immune environment. We present Prostate-TriMod, a novel tri-modal histology dataset designed to integrate high-resolution visual morphology with spatial tissue maps, immune infiltration patterns, and clinical outcomes. This dataset, generated from the Cell DIVE multiplexed imaging platform, consists of three synchronized modalities: (1) multiscale virtual H&E tiles (224px, 256px, 512px, and 2040px) providing visual morphological context, (2) spatial tissue maps identifying cancerous/non-cancerous epithelial cells, stroma and immune cell populations (via TOPAZ and CAT models), and (3) text captions generated from single-cell data and patterns. The dataset includes comprehensive clinical annotations, including Grade Groups and biochemical recurrence (BCR) status. By providing high-fidelity alignment between visual features, spatial tissue maps, and textual descriptions, Prostate-TriMod empowers the development of advanced multimodal AI frameworks. We expect this resource to support reuse in multimodal representation learning, spatial analysis, and benchmarking studies that link histology morphology and immune context to clinical outcomes in prostate cancer.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 25%
7.1%
2
GigaScience
172 papers in training set
Top 0.2%
6.3%
3
iScience
1063 papers in training set
Top 1%
6.3%
4
Bioinformatics
1061 papers in training set
Top 4%
6.3%
5
Scientific Data
174 papers in training set
Top 0.3%
6.3%
6
Advanced Science
249 papers in training set
Top 4%
4.8%
7
Scientific Reports
3102 papers in training set
Top 24%
4.8%
8
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
9
Patterns
70 papers in training set
Top 0.2%
3.9%
50% of probability mass above
10
Genome Medicine
154 papers in training set
Top 2%
3.6%
11
Nature Methods
336 papers in training set
Top 3%
3.2%
12
npj Digital Medicine
97 papers in training set
Top 2%
2.6%
13
Nucleic Acids Research
1128 papers in training set
Top 8%
2.3%
14
PLOS ONE
4510 papers in training set
Top 48%
2.1%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.9%
16
Cancer Research
116 papers in training set
Top 2%
1.9%
17
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
18
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.3%
19
Frontiers in Bioinformatics
45 papers in training set
Top 0.4%
1.2%
20
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.2%
21
Cell Reports Methods
141 papers in training set
Top 4%
0.9%
22
Cell Systems
167 papers in training set
Top 11%
0.9%
23
Disease Models & Mechanisms
119 papers in training set
Top 2%
0.8%
24
eLife
5422 papers in training set
Top 56%
0.8%
25
Journal of Translational Medicine
46 papers in training set
Top 2%
0.8%
26
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
27
Genome Biology
555 papers in training set
Top 8%
0.7%
28
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.7%
29
Journal of Pathology Informatics
13 papers in training set
Top 0.4%
0.7%
30
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%