Back

CCIDeconv: Hierarchical model for deconvolution of subcellular cell-cell interactions in single-cell data

Jayakumar, R.; Panwar, P.; Yang, J. Y. H.; Ghazanfar, S.

2026-03-30 bioinformatics
10.64898/2026.03.26.714643 bioRxiv
Show abstract

MotivationCell-cell interaction (CCI) underlies several fundamental mechanisms including development, homeostasis and disease progression. CCI are known to be localised to specific subcellular regions, for example, within the cytoplasms of cells. With the emergence of subcellular spatial transcriptomics technologies (sST), there is an opportunity to attribute CCI to subcellular regions. We aimed to deconvolute CCI to subcellular CCI (sCCI) in non-spatial single cell transcriptomics data (i.e. scRNA-seq) datasets using a modified CCI score from CellChat. ResultsBy calculating the sCCI score specific to cytoplasm and nucleus in nine publicly available sST datasets, we identified unique nucleus-nucleus and cytoplasm-cytoplasm sCCI. Then, we deconvolved the communication score to subcellular regions by using a hierarchical classification and regression model which we name as CCIDeconv. We performed leave-one-dataset-out cross-validation across nine datasets over a range of different tissue types from human samples. We observed that training across many different tissue types resulted in robust deconvolution performance in an unseen dataset. As the number of training datasets increased, models trained without spatial features achieved similar performance as models including spatial features. This implied the potential for accurate prediction of sCCI events from even scRNA-seq with large numbers of training datasets. Overall, we offer a method towards attributing CCI events to subcellular regions. This method can allow researchers in dissecting sCCI patterns to gain insights in underlying biology in a range of tissues covering health and disease.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
18.4%
2
BMC Bioinformatics
383 papers in training set
Top 0.3%
18.3%
3
Frontiers in Genetics
197 papers in training set
Top 1.0%
6.2%
4
Genome Biology
555 papers in training set
Top 2%
3.9%
5
Bioinformatics Advances
184 papers in training set
Top 1%
3.9%
50% of probability mass above
6
Nature Communications
4913 papers in training set
Top 41%
3.5%
7
Scientific Reports
3102 papers in training set
Top 38%
3.5%
8
iScience
1063 papers in training set
Top 7%
2.8%
9
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.7%
10
PLOS ONE
4510 papers in training set
Top 47%
2.3%
11
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.1%
12
PLOS Computational Biology
1633 papers in training set
Top 14%
2.0%
13
GigaScience
172 papers in training set
Top 1%
1.9%
14
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.5%
15
Genome Medicine
154 papers in training set
Top 5%
1.5%
16
Database
51 papers in training set
Top 0.5%
1.3%
17
Communications Biology
886 papers in training set
Top 15%
1.2%
18
Nucleic Acids Research
1128 papers in training set
Top 14%
1.2%
19
Heliyon
146 papers in training set
Top 4%
0.9%
20
BioData Mining
15 papers in training set
Top 0.6%
0.9%
21
Life Science Alliance
263 papers in training set
Top 1%
0.9%
22
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
23
eneuro
389 papers in training set
Top 9%
0.8%
24
Molecular Systems Biology
142 papers in training set
Top 2%
0.7%
25
Advanced Science
249 papers in training set
Top 20%
0.7%
26
PeerJ
261 papers in training set
Top 18%
0.6%
27
Patterns
70 papers in training set
Top 3%
0.6%
28
Genomics
60 papers in training set
Top 3%
0.6%
29
Frontiers in Bioinformatics
45 papers in training set
Top 1%
0.6%