Back

UMITIC: An unsupervised framework for the joint characterization of cellular phenotypes and spatial neighborhoods in multiplex and hyperplex immunofluorescence imaging data

Sangüesa Recalde, M.; De Andrea, C. E.; Ariz, M.

2026-06-01 bioinformatics
10.64898/2026.05.29.728633 bioRxiv
Show abstract

Multiplexed imaging technologies enable the simultaneous measurement of dozens of protein markers while preserving context, providing a high-resolution view of tissue organization schemes. However, extracting meaningful insights from these high-dimensional datasets--particularly in hyperplex settings (>20 markers)--remains a major computational challenge, especially in the absence of annotated data. Here, we present UMITIC (Unsupervised Analysis of Multiplex Images via TIssue Characterization), a modular and unsupervised computational framework for the joint characterization of cell phenotypes and tissue neighborhoods from multiplex imaging data. UMITIC integrates three components: (i) CellCut, a strategy that combines nuclear and cytoplasmic predictions to improve the delineation capabilities of the framework; (ii) CellMap, a contrastive learning approach that generates low-dimensional representations of single-cell image crops that are enriched with morphological features; and (iii) TissueNet, a graph neural network that models spatial cell-cell interactions to identify tissue neighborhoods. We evaluated UMITIC across four datasets of increasing complexity to assess its robustness, scalability and biological relevance. With respect to a 7-plex human tonsil dataset, the framework identified canonical immune cell populations and reconstructed well-established anatomical regions. When applied to a 43-plex tonsil image, UMITIC preserved these tissue-level structures while enabling a finer cell subtype stratification process driven by increased marker dimensionality. We further validated our method on a 58-plex colorectal cancer cohort, where UMITIC was able to recover previously reported immune composition differences and spatial organization variations between patient groups with different prognoses. Finally, when an expert-annotated mass cytometry imaging dataset concerning human lung tissue was used, UMITIC achieved higher agreement with the reference tissue annotations than the existing approaches did, demonstrating improved lung microanatomy reconstruction accuracy. Together, these results show that UMITIC enables consistent and interpretable analyses of both cellular phenotypes and tissue architectures across diverse multiplex and hyperplex imaging datasets without the need for manual annotations. Author summaryUnderstanding how cells are organized within tissues is fundamental to deciphering diseases, yet analyzing tissue imaging data remains a major challenge. The recently developed imaging technologies enable the visualization of dozens of proteins in a single tissue section, revealing unprecedented cell identity and spatial organization details. However, extracting meaningful biological insights requires extensive manual annotation work performed by expert pathologists, limiting the scalability. Here, we present a fully automated computational framework that characterizes tissue architectures in an unsupervised manner at two complementary levels: it identifies cell types based on their protein expressions and morphologies and maps how those cells are organized into spatially coherent tissue structures, and it does so without requiring any manual annotations. Our approach is modular and interpretable at the cell level. We validated our framework across four independent datasets with panels consisting of 7 to 58 simultaneous protein markers, including healthy human tissue and a colorectal cancer cohort in which patients with distinct immune profiles were analyzed. Remarkably, UMITIC improved upon the performance of existing methods across both qualitative and quantitative assessments. These results suggest that our framework provides objective, interpretable and reproducible image processing tools for conducting tissue analyses in both research and clinical settings.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
18.1%
2
Nature Communications
4913 papers in training set
Top 25%
7.1%
3
Nature Methods
336 papers in training set
Top 2%
6.3%
4
Cell Systems
167 papers in training set
Top 3%
4.8%
5
Cell Reports Methods
141 papers in training set
Top 0.5%
4.8%
6
Patterns
70 papers in training set
Top 0.1%
4.1%
7
Advanced Science
249 papers in training set
Top 5%
3.9%
8
Nature Biotechnology
147 papers in training set
Top 3%
3.0%
50% of probability mass above
9
Cytometry Part A
30 papers in training set
Top 0.1%
2.7%
10
Nucleic Acids Research
1128 papers in training set
Top 8%
2.6%
11
PLOS Computational Biology
1633 papers in training set
Top 13%
2.4%
12
Biological Imaging
15 papers in training set
Top 0.1%
2.4%
13
Communications Biology
886 papers in training set
Top 4%
2.4%
14
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.3%
15
Genome Medicine
154 papers in training set
Top 4%
1.9%
16
iScience
1063 papers in training set
Top 12%
1.9%
17
Scientific Reports
3102 papers in training set
Top 55%
1.8%
18
Genome Biology
555 papers in training set
Top 5%
1.6%
19
GigaScience
172 papers in training set
Top 2%
1.6%
20
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.3%
21
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
22
PLOS ONE
4510 papers in training set
Top 60%
1.2%
23
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
24
Science Advances
1098 papers in training set
Top 29%
0.8%
25
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
26
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
27
Cell Genomics
162 papers in training set
Top 6%
0.8%
28
Journal of Cell Science
353 papers in training set
Top 2%
0.7%
29
Small Methods
26 papers in training set
Top 1%
0.7%
30
Frontiers in Immunology
586 papers in training set
Top 9%
0.6%