Back

One Cell At a Time: A Unified Framework to Integrate and Analyze Single-cell RNA-seq Data

Wang, C. X.; Zhang, L.; Wang, B.

2021-07-16 genomics
10.1101/2021.05.12.443814 bioRxiv
Show abstract

1The surge of single-cell RNA sequencing technologies gives rise to the abundance of large single-cell RNA-seq datasets at the scale of hundreds of thousands of single cells. Integrative analysis of large-scale scRNA-seq datasets has the potential of revealing de novo cell types as well as aggregating biological information. However, most existing methods fail to integrate multiple large-scale scRNA-seq datasets in a computational and memory efficient way. We hereby propose OCAT, One Cell At a Time, a graph-based method that sparsely encodes single-cell gene expressions to integrate data from multiple sources without most variable gene selection or explicit batch effect correction. We demonstrate that OCAT efficiently integrates multiple scRNA-seq datasets and achieves the state-of-the-art performance in cell type clustering, especially in challenging scenarios of non-overlapping cell types. In addition, OCAT efficaciously facilitates a variety of downstream analyses, such as differential gene analysis, trajectory inference, pseudotime inference and cell inference. OCAT is a unifying tool to simplify and expedite the analysis of large-scale scRNA-seq data from heterogeneous sources.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Genome Research
409 papers in training set
Top 0.1%
25.7%
2
Bioinformatics
1061 papers in training set
Top 2%
12.2%
3
Nature Methods
336 papers in training set
Top 1%
7.1%
4
Nature Computational Science
50 papers in training set
Top 0.1%
6.8%
50% of probability mass above
5
Nature Biotechnology
147 papers in training set
Top 1%
6.3%
6
Genome Biology
555 papers in training set
Top 1%
6.3%
7
Nature Communications
4913 papers in training set
Top 35%
4.3%
8
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
3.6%
9
Nucleic Acids Research
1128 papers in training set
Top 7%
3.0%
10
Briefings in Bioinformatics
326 papers in training set
Top 2%
2.7%
11
Nature Genetics
240 papers in training set
Top 3%
2.6%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 30%
1.9%
13
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.4%
0.9%
14
iScience
1063 papers in training set
Top 25%
0.9%
15
Genome Medicine
154 papers in training set
Top 7%
0.9%
16
Cell Reports Methods
141 papers in training set
Top 4%
0.9%
17
PLOS Computational Biology
1633 papers in training set
Top 25%
0.7%
18
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.7%
19
PLOS ONE
4510 papers in training set
Top 72%
0.6%
20
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.6%
21
Bioinformatics Advances
184 papers in training set
Top 5%
0.6%