Back

Jointly aligning cells and genomic features of single-cell multi-omics data with co-optimal transport

Demetci, P.; Tran, Q. H.; Redko, I.; Singh, R.

2022-12-11 bioinformatics
10.1101/2022.11.09.515883 bioRxiv
Show abstract

The availability of various single-cell sequencing technologies allows one to jointly study multiple genomic features and understand how they interact to regulate cells. Although there are experimental challenges to simultaneously profile multiple features on the same single cell, recent computational methods can align the cells from unpaired multi-omic datasets. However, studying regulation also requires us to map the genomic features across different measurements. Unfortunately, most single-cell multi-omic alignment tools cannot perform these alignments or need prior knowledge. We introduce O_SCPLOWSCOOTRC_SCPLOW, a co-optimal transport-based method, which jointly aligns both cells and genomic features of unpaired single-cell multi-omic datasets. We apply O_SCPLOWSCOOTRC_SCPLOW to various single-cell multi-omic datasets with different types of measurements. Our results show that O_SCPLOWSCOOTRC_SCPLOW provides quality alignments for unsupervised cell-level and feature-level integration of datasets with sparse feature correspondences (e.g., one-to-one mappings). For datasets with dense feature correspondences (e.g., many-to-many mappings), our joint framework allows us to provide supervision on one level (e.g., cell types), thus improving alignment performance on the other (e.g., genomic features) or vice-versa. The unique joint alignment framework makes O_SCPLOWSCOOTRC_SCPLOW a helpful hypothesis-generation tool for the integrative study of unpaired single-cell multi-omic datasets. Available at: https://github.com/rsinghlab/SCOOTR.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Biology
555 papers in training set
Top 0.1%
26.1%
2
Cell Systems
167 papers in training set
Top 0.6%
14.5%
3
Genome Research
409 papers in training set
Top 0.1%
10.2%
50% of probability mass above
4
Bioinformatics
1061 papers in training set
Top 3%
8.5%
5
Nature Methods
336 papers in training set
Top 2%
6.4%
6
Nature Biotechnology
147 papers in training set
Top 2%
4.9%
7
Nature Communications
4913 papers in training set
Top 32%
4.9%
8
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
9
Nucleic Acids Research
1128 papers in training set
Top 10%
1.7%
10
iScience
1063 papers in training set
Top 19%
1.3%
11
Nature Genetics
240 papers in training set
Top 5%
1.3%
12
PLOS Computational Biology
1633 papers in training set
Top 19%
1.2%
13
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
14
BMC Bioinformatics
383 papers in training set
Top 6%
0.8%
15
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
16
Genome Medicine
154 papers in training set
Top 7%
0.8%
17
Nature
575 papers in training set
Top 15%
0.8%
18
Nature Computational Science
50 papers in training set
Top 2%
0.7%
19
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.6%
20
Cell
370 papers in training set
Top 18%
0.6%
21
PLOS ONE
4510 papers in training set
Top 71%
0.6%
22
GigaScience
172 papers in training set
Top 4%
0.5%
23
Frontiers in Genetics
197 papers in training set
Top 12%
0.5%