Back

CellWalker2: multi-omic discovery of hierarchical cell type relationships and their associations with genomic annotations

Hu, Z.; Przytycki, P. F.; Pollard, K. S.

2024-05-17 bioinformatics
10.1101/2024.05.17.594770 bioRxiv
Show abstract

CellWalker2 is a graph diffusion-based method for single-cell genomics data integration. It extends the CellWalker model by incorporating hierarchical relationships between cell types, providing estimates of statistical significance, and adding data structures for analyzing multi-omics data so that gene expression and open chromatin can be jointly modeled. Our open-source software enables users to annotate cells using existing ontologies and to probabilistically match cell types between two or more contexts, including across species. CellWalker2 can also map genomic regions to cell ontologies, enabling precise annotation of elements derived from bulk data, such as enhancers, genetic variants, and sequence motifs. Through simulation studies, we show that CellWalker2 performs better than existing methods in cell type annotation and mapping. We then use data from the brain and immune system to demonstrate CellWalker2s ability to discover cell type-specific regulatory programs and both conserved and divergent cell type relationships in complex tissues.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genome Biology
555 papers in training set
Top 0.1%
22.2%
2
Bioinformatics
1061 papers in training set
Top 3%
10.0%
3
Cell Systems
167 papers in training set
Top 1%
8.3%
4
Nature Biotechnology
147 papers in training set
Top 0.9%
8.3%
5
Genome Research
409 papers in training set
Top 0.4%
6.2%
50% of probability mass above
6
Nucleic Acids Research
1128 papers in training set
Top 4%
4.8%
7
Nature Methods
336 papers in training set
Top 2%
4.8%
8
Genome Medicine
154 papers in training set
Top 2%
4.1%
9
Nature Genetics
240 papers in training set
Top 2%
3.6%
10
Nature Communications
4913 papers in training set
Top 43%
3.0%
11
PLOS Computational Biology
1633 papers in training set
Top 12%
2.7%
12
BMC Bioinformatics
383 papers in training set
Top 4%
2.3%
13
PLOS ONE
4510 papers in training set
Top 51%
1.9%
14
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.5%
15
Molecular Systems Biology
142 papers in training set
Top 1.0%
1.2%
16
Briefings in Bioinformatics
326 papers in training set
Top 5%
0.9%
17
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
18
The American Journal of Human Genetics
206 papers in training set
Top 3%
0.9%
19
BMC Genomics
328 papers in training set
Top 5%
0.9%
20
PLOS Genetics
756 papers in training set
Top 13%
0.9%
21
Cell Reports Methods
141 papers in training set
Top 5%
0.7%
22
Nature
575 papers in training set
Top 16%
0.7%
23
iScience
1063 papers in training set
Top 35%
0.7%
24
Science
429 papers in training set
Top 21%
0.6%