Back

A harmonized single-cell RNA-seq atlas of human localized and metastatic prostate cancers and benign tissues

Cho, H.; Zhang, Y.; Zhou, J.; Daggar, A.; Kang, S.; Mannan, R.; Cao, X.; Dhanasekaran, S. M.; Chinnaiyan, A. M.

2026-05-20 cancer biology
10.64898/2026.05.18.725966 bioRxiv
Show abstract

Single-cell RNA sequencing (scRNA-seq) effectively captures the differences in transcriptomic landscape of cell types and cell states between benign and cancer tissues. Pooling publicly available datasets distributed across independent studies enables increased sample representation and cross-study comparisons. Here we present a harmonized scRNA-seq atlas of the human prostate constructed by integrating 17 available studies, comprising 163 samples from 106 donors. The dataset contains benign tissue, primary tumors, and metastatic disease profiles. Raw sequencing FASTQ data files were uniformly reprocessed to minimize technical variability. Study metadata were curated and standardized using a unified schema capturing donor identity, tissue site, disease context, and histologic grade. Post quality control, the integrated dataset contains 754,000 high-quality cells. Harmonized cell type annotations were generated using a pseudobulk correlation framework informed by multiple reference resources. The workflow identified 17 distinct cell types representing epithelial, mesenchymal, and immune compartments of the prostate. The processed expression matrices, standardized metadata, and analysis workflows are publicly available to support reproducible analysis and enable exploration of heterogeneity across prostate disease states.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 2%
23.5%
2
Genome Medicine
154 papers in training set
Top 0.1%
23.5%
3
Cell Reports
1338 papers in training set
Top 6%
6.6%
50% of probability mass above
4
Scientific Reports
3102 papers in training set
Top 30%
4.0%
5
JCI Insight
241 papers in training set
Top 2%
2.7%
6
Cell Reports Medicine
140 papers in training set
Top 2%
2.5%
7
Journal of Translational Medicine
46 papers in training set
Top 0.5%
2.2%
8
Communications Biology
886 papers in training set
Top 5%
2.2%
9
Advanced Science
249 papers in training set
Top 10%
1.8%
10
Scientific Data
174 papers in training set
Top 1%
1.6%
11
PLOS ONE
4510 papers in training set
Top 57%
1.4%
12
Nature Genetics
240 papers in training set
Top 5%
1.4%
13
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
14
Cell Genomics
162 papers in training set
Top 5%
1.2%
15
Cancer Research
116 papers in training set
Top 3%
1.2%
16
Modern Pathology
21 papers in training set
Top 0.3%
1.0%
17
Genome Biology
555 papers in training set
Top 6%
0.9%
18
Journal of Clinical Investigation
164 papers in training set
Top 5%
0.9%
19
eLife
5422 papers in training set
Top 52%
0.9%
20
Science Translational Medicine
111 papers in training set
Top 6%
0.8%
21
Cancer Research Communications
46 papers in training set
Top 1%
0.8%
22
Clinical Cancer Research
58 papers in training set
Top 2%
0.8%
23
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.8%
24
Cancer Discovery
61 papers in training set
Top 2%
0.7%
25
Science Advances
1098 papers in training set
Top 33%
0.7%
26
iScience
1063 papers in training set
Top 39%
0.5%
27
The Prostate
11 papers in training set
Top 0.2%
0.5%
28
Cell Reports Methods
141 papers in training set
Top 7%
0.5%
29
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 49%
0.5%