Back

Cellector: A tool to detect foreign genotype cells in scRNAseq data with applications in leukemia and microchimerism.

Heaton, H.; Behboudi, R.; Ward, C.; Weerakoon, M.; Kanaan, S.; Reichle, S.; Hunter, N.; Furlan, S.

2026-03-30 bioinformatics
10.64898/2026.03.26.714571 bioRxiv
Show abstract

The existence of rare, genetically distinct cells can occur in various samples such as transplant patients, naturally occurring microchimerism between maternal and fetal tissues, and cancer samples with sufficient mutational burden. Computational methods for detecting these foreign cells are vital to studying these biological conditions. An application that is of particular interest is that of leukemia patients post hematopoietic cell transplant (HCT). In many leukemias, a primary therapy is HCT, after which, the primary genotype of the bone marrow and blood cells should be of donor origin. If cells exist that are of the patients genotype and the cell type lineage of the particular leukemia, this is known as measurable residual disease (MRD). If the MRD is high enough, this may represent a relapse of the patients leukemia. Furthermore, accurately estimating the MRD is important for driving clinical decision making for these patients. Here we present Cellector, a computational method for identifying rare foreign genotype cells in single cell RNAseq (scRNAseq) datasets. We show cellector accurately detects microchimeric cells down to an exceedingly low percentage of these cells present (0.05% or lower).

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1.0%
23.2%
2
BMC Bioinformatics
383 papers in training set
Top 0.6%
12.7%
3
PLOS Computational Biology
1633 papers in training set
Top 7%
5.0%
4
Nucleic Acids Research
1128 papers in training set
Top 5%
4.1%
5
PLOS ONE
4510 papers in training set
Top 38%
3.7%
6
iScience
1063 papers in training set
Top 4%
3.7%
50% of probability mass above
7
Scientific Reports
3102 papers in training set
Top 34%
3.7%
8
Frontiers in Genetics
197 papers in training set
Top 2%
3.7%
9
Genome Biology
555 papers in training set
Top 2%
3.7%
10
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.3%
11
Nature Communications
4913 papers in training set
Top 46%
2.1%
12
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.7%
13
eLife
5422 papers in training set
Top 40%
1.7%
14
Communications Biology
886 papers in training set
Top 12%
1.4%
15
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.3%
1.4%
16
Cytometry Part A
30 papers in training set
Top 0.2%
1.3%
17
Bioinformatics Advances
184 papers in training set
Top 4%
1.3%
18
BioData Mining
15 papers in training set
Top 0.5%
1.1%
19
BMC Medical Genomics
36 papers in training set
Top 0.8%
1.0%
20
Genome Medicine
154 papers in training set
Top 7%
0.8%
21
Cell Reports Methods
141 papers in training set
Top 4%
0.8%
22
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.8%
23
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.8%
24
Frontiers in Physiology
93 papers in training set
Top 6%
0.8%
25
Cancer Research Communications
46 papers in training set
Top 1%
0.7%
26
Genome Research
409 papers in training set
Top 4%
0.7%
27
BMC Genomics
328 papers in training set
Top 6%
0.7%
28
GigaScience
172 papers in training set
Top 4%
0.7%
29
Patterns
70 papers in training set
Top 3%
0.7%
30
Cell Systems
167 papers in training set
Top 14%
0.5%