Back

cellNexus: Quality control, annotation, aggregation and analytical layers for the Human Cell Atlas data

Shen, M.; Gao, Y.; Liu, N.; Bhuva, D.; Milton, M.; Henao, J.; Andrews, J.; Yang, E.; Zhan, C.; Liu, N.; Si, S.; Hutchison, W. J.; Shakeel, M. H.; Morgan, M.; Papenfuss, A. T.; Iskander, J.; Polo, J. M.; Mangiola, S.

2026-04-17 bioinformatics
10.64898/2026.04.14.718336 bioRxiv
Show abstract

Large-scale single-cell atlases such as the Human Cell Atlas have transformed our understanding of human biology. Yet, the lack of a robust framework that standardises quality control, expands cellular annotation, and adds normalisation and analytical layers, limits multi-study analyses and the usefulness of this resource. Here we present cellNexus, a comprehensive tool and resource that converts the Human Cell Atlas collection into analysis-ready data by linking quality control layers, metadata enrichment, expression normalisation, analysis and data aggregation. These enhancements enable robust statistical modelling across studies, exemplified by a multi-tissue map of immune cell communication during ageing, which reveals macrophage-muscle axes as among the most depleted regenerative interactions with age. All harmonised layers, including pseudobulk and cell-cell communication summaries, are accessible via a public web interface and with R and Python APIs. By providing continuous integration with CELLxGENE releases, cellNexus transforms large cell atlas corpora into an accessible, reproducible, interoperable foundation for large-scale biological discovery and the next generation of single-cell foundation models.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 4%
21.9%
2
Nucleic Acids Research
1128 papers in training set
Top 2%
8.2%
3
Nature
575 papers in training set
Top 5%
6.1%
4
Genome Medicine
154 papers in training set
Top 2%
4.2%
5
Genome Biology
555 papers in training set
Top 2%
3.6%
6
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
7
Aging Cell
144 papers in training set
Top 1%
3.5%
50% of probability mass above
8
Science
429 papers in training set
Top 9%
3.5%
9
Nature Aging
51 papers in training set
Top 0.6%
3.0%
10
Nature Cell Biology
99 papers in training set
Top 2%
2.4%
11
Nature Methods
336 papers in training set
Top 4%
2.0%
12
Cell Genomics
162 papers in training set
Top 3%
2.0%
13
Nature Genetics
240 papers in training set
Top 4%
1.8%
14
Molecular Systems Biology
142 papers in training set
Top 0.5%
1.8%
15
Scientific Data
174 papers in training set
Top 1%
1.6%
16
Advanced Science
249 papers in training set
Top 12%
1.6%
17
Life Science Alliance
263 papers in training set
Top 0.4%
1.6%
18
Bioinformatics
1061 papers in training set
Top 7%
1.6%
19
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.6%
20
Communications Biology
886 papers in training set
Top 10%
1.6%
21
Cell Reports
1338 papers in training set
Top 27%
1.3%
22
Nature Medicine
117 papers in training set
Top 3%
1.3%
23
Cell Systems
167 papers in training set
Top 9%
1.2%
24
PLOS Computational Biology
1633 papers in training set
Top 23%
0.9%
25
GigaScience
172 papers in training set
Top 3%
0.7%
26
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
27
eLife
5422 papers in training set
Top 60%
0.7%
28
Scientific Reports
3102 papers in training set
Top 77%
0.7%
29
Development
440 papers in training set
Top 4%
0.7%
30
Cell
370 papers in training set
Top 18%
0.7%