Back

DCAFA: Differential Community Abundance and Feature Analysis for Histological Images

Wright, G.; Keller, P.; Muter, J.; Brosens, J.; Tejpar, S.; Minhas, F.

2026-04-30 bioinformatics
10.64898/2026.04.28.721329 bioRxiv
Show abstract

Histological images are composed of diverse local structures such as cells, glands, or tissue patches, whose organisation and relative abundance reflect underlying biological processes. While most computational approaches focus on analysing individual features, many clinically relevant signals arise from changes in the composition of these structures rather than isolated measurements. This motivates the need for explicitly modelling how groups of similar instances vary across samples and relate to outcomes. We present DCAFA (Differential Community Abundance and Feature Attribution Analysis), a regression-based framework for analysing hierarchical biomedical data through both compositional and feature-level perspectives. DCAFA groups instances into latent communities representing recurring morphological or phenotypic patterns, and then performs two complementary analyses: (i) community composition analysis, which identifies groups that are enriched or depleted across outcomes, and (ii) feature attribution analysis, which quantifies how instance-level features relate to outcomes directly or within specific communities. Both use generalised linear and mixed-effects models, enabling covariate adjustment and inference through effect sizes, confidence intervals, and false discovery rate control. We demonstrate the utility of DCAFA across multiple biomedical settings, including endometrial histopathology, spatial transcriptomics, multiplex immunofluorescence imaging, and predefined cell-type analyses in colorectal cancer. These examples identify interpretable compositional shifts and context-specific feature associations that are not captured by conventional feature-based approaches. By unifying differential abundance testing and feature attribution within a single statistical framework, DCAFA serves as an openly available toolbox that provides a practical and interpretable means of linking tissue composition with clinical or molecular outcomes in biomedical imaging data. Code available at: https://github.com/wgrgwrght/DCAFA

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
22.2%
2
PLOS Computational Biology
1633 papers in training set
Top 2%
14.2%
3
Cell Systems
167 papers in training set
Top 3%
4.3%
4
Genome Biology
555 papers in training set
Top 2%
4.3%
5
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.5%
6
Nature Communications
4913 papers in training set
Top 41%
3.5%
50% of probability mass above
7
Patterns
70 papers in training set
Top 0.3%
3.0%
8
Genome Medicine
154 papers in training set
Top 3%
3.0%
9
Nature Methods
336 papers in training set
Top 3%
3.0%
10
BMC Bioinformatics
383 papers in training set
Top 3%
2.6%
11
Nature Biotechnology
147 papers in training set
Top 4%
2.1%
12
Scientific Reports
3102 papers in training set
Top 53%
1.9%
13
Bioinformatics Advances
184 papers in training set
Top 3%
1.8%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 31%
1.8%
15
PLOS ONE
4510 papers in training set
Top 52%
1.8%
16
Advanced Science
249 papers in training set
Top 14%
1.3%
17
Biological Imaging
15 papers in training set
Top 0.1%
1.2%
18
Cell Reports Methods
141 papers in training set
Top 4%
1.1%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.9%
20
Nucleic Acids Research
1128 papers in training set
Top 15%
0.9%
21
iScience
1063 papers in training set
Top 25%
0.9%
22
Communications Biology
886 papers in training set
Top 22%
0.8%
23
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
24
Frontiers in Bioinformatics
45 papers in training set
Top 0.8%
0.8%
25
Genome Research
409 papers in training set
Top 4%
0.7%
26
eLife
5422 papers in training set
Top 60%
0.7%
27
GigaScience
172 papers in training set
Top 3%
0.7%