Back

GxP Single-cell RNA-seq and Spatial Transcriptomics end-to-end pipeline for clinical research

Zaratiegui, A.; Burfield, T.; Povlsen, H. R.; Sola, M. E. G.; Czaban, A.; Soh, K.; Das, V.

2026-01-26 bioinformatics
10.64898/2026.01.23.701261 bioRxiv
Show abstract

Single-cell/nucleus RNA-sequencing and Spatial Transcriptomics are powerful tools for investigating cellular heterogeneity and tissue architecture that have deepened our disease understanding. Their broader adoption in clinical and regulated settings, however, is hindered by challenges related to data integrity, regulatory compliance, reproducibility, and scalability. To address this gap, we developed NNclinSSOAP (Novo Nordisk Clinical Single-cell Spatial Omics Analytical Pipeline) - a modular, GxP-ready end-to-end computational pipeline, that combines established single-cell workflows with a new Nextflow pipeline for Spatial Transcriptomics. NNclinSSOAP transforms RNA sequencing and Xenium spatial data into integrated, annotated single-cell objects and spatially resolved tissue maps. Designed to support mechanistic studies and clinical endpoint generation, it enables traceable and reproducible processing of large-scale datasets, scalable for both local and HPC environments. Here, we provide a step-by-step guide for using NNclinSSOAP. All code and data are publicly available. Using a standard laptop, the pipeline can be executed within 1.5 hours.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Nature Biotechnology
147 papers in training set
Top 0.1%
23.5%
2
Nature Methods
336 papers in training set
Top 0.3%
19.5%
3
Genome Biology
555 papers in training set
Top 0.6%
8.6%
50% of probability mass above
4
Nature Communications
4913 papers in training set
Top 27%
6.7%
5
Nucleic Acids Research
1128 papers in training set
Top 4%
4.5%
6
Cell Systems
167 papers in training set
Top 3%
4.1%
7
Genome Medicine
154 papers in training set
Top 2%
3.8%
8
Bioinformatics
1061 papers in training set
Top 6%
2.9%
9
Nature Genetics
240 papers in training set
Top 3%
2.9%
10
Advanced Science
249 papers in training set
Top 8%
2.2%
11
Nature
575 papers in training set
Top 9%
2.2%
12
Genome Research
409 papers in training set
Top 2%
2.0%
13
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.2%
14
Cell Reports Methods
141 papers in training set
Top 4%
1.0%
15
PLOS ONE
4510 papers in training set
Top 68%
0.8%
16
iScience
1063 papers in training set
Top 32%
0.8%
17
Nature Computational Science
50 papers in training set
Top 2%
0.7%
18
Bioinformatics Advances
184 papers in training set
Top 5%
0.5%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 48%
0.5%
20
Cell Reports Medicine
140 papers in training set
Top 10%
0.5%
21
eLife
5422 papers in training set
Top 62%
0.5%
22
Nature Biomedical Engineering
42 papers in training set
Top 3%
0.5%
23
Communications Biology
886 papers in training set
Top 31%
0.5%
24
Cell
370 papers in training set
Top 19%
0.5%
25
Patterns
70 papers in training set
Top 3%
0.5%