Back

found: Inferring cell-level perturbation from structured label noise in single-cell data

Afanasiev, E.; Goeva, A.

2026-04-14 bioinformatics
10.64898/2026.04.10.717768 bioRxiv
Show abstract

Recent work by Goeva et al. introduced HiDDEN, a method for refining batch-level labels to infer cell-level perturbation without prior knowledge of affected populations, addressing the mismatch between sample-level labels and heterogeneous perturbation effects across cells. Here, we present found, a Python and R implementation of HiDDEN, supporting pipeline customization, by-factor grouping, hyperparameter selection, and visualization. Through benchmarking across diverse datasets, we show that performance depends strongly on modeling choices, particularly regression, grouping, and embedding dimensionality. found provides a practical, flexible, and accessible framework for robust cell-level perturbation analysis.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 9%
16.9%
2
Nature Methods
336 papers in training set
Top 0.8%
11.9%
3
Genome Biology
555 papers in training set
Top 0.3%
11.9%
4
Cell Systems
167 papers in training set
Top 2%
8.1%
5
Nature Genetics
240 papers in training set
Top 1%
6.1%
50% of probability mass above
6
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.7%
7
Nature Biotechnology
147 papers in training set
Top 2%
4.7%
8
Bioinformatics
1061 papers in training set
Top 5%
3.8%
9
Genome Research
409 papers in training set
Top 1%
3.1%
10
PLOS Computational Biology
1633 papers in training set
Top 12%
2.6%
11
Nucleic Acids Research
1128 papers in training set
Top 9%
2.0%
12
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.6%
13
Genome Medicine
154 papers in training set
Top 5%
1.6%
14
Nature
575 papers in training set
Top 13%
1.3%
15
BMC Bioinformatics
383 papers in training set
Top 6%
1.2%
16
Nature Computational Science
50 papers in training set
Top 1%
1.2%
17
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
18
PLOS ONE
4510 papers in training set
Top 65%
0.9%
19
Cell Genomics
162 papers in training set
Top 7%
0.7%
20
Molecular Systems Biology
142 papers in training set
Top 2%
0.7%
21
Communications Biology
886 papers in training set
Top 26%
0.7%
22
Patterns
70 papers in training set
Top 3%
0.7%
23
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%
24
Nature Machine Intelligence
61 papers in training set
Top 4%
0.7%
25
Scientific Reports
3102 papers in training set
Top 79%
0.6%
26
Bioinformatics Advances
184 papers in training set
Top 5%
0.6%