Back

Global cell-state and gene-program representations reveal conserved and context-specific perturbation responses of cells

Pan, X.; Saunders, R.; Replogle, J. M.; Weissman, J. S.; Zhuang, X.

2026-05-18 systems biology
10.64898/2026.05.16.725005 bioRxiv
Show abstract

Understanding how cell states change in response to genetic perturbations is critical for gene-function and therapeutics discovery. However, state-of-the-art deep-learning models trained on large single-cell omics datasets still struggle to accurately predict cellular responses to perturbations, highlighting the need for a better understanding of the cell-state space and how cells move through this space. Here, we present a contrastive learning model that integrates diverse scRNA-seq datasets into a global, interpretable cell-state manifold. We further develop a framework to integrate this global cell-state manifold with genome-scale perturbation data to identify gene-expression programs that define principal axes of cell-state transitions and functional embeddings of genes that define major perturbation classes. Applying this framework across Perturb-seq datasets on different cell types reveals conserved cellular responses to perturbations, as well as cell-type-specific rewiring of stress responses. Moreover, we perform a genome-scale Perturb-seq screen in human embryonic stem cells, validating and extending these findings and uncovering a class of mesenchymal transitions induced by diverse perturbations to cellular stress-response pathways.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Cell Systems
167 papers in training set
Top 0.2%
22.7%
2
Nature Communications
4913 papers in training set
Top 13%
12.6%
3
Nature Biotechnology
147 papers in training set
Top 2%
4.9%
4
Cell Reports
1338 papers in training set
Top 14%
3.6%
5
Nature Methods
336 papers in training set
Top 3%
3.6%
6
Genome Biology
555 papers in training set
Top 2%
3.6%
50% of probability mass above
7
Nature
575 papers in training set
Top 7%
3.6%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 20%
3.6%
9
Nature Genetics
240 papers in training set
Top 2%
3.6%
10
Science Advances
1098 papers in training set
Top 10%
2.6%
11
Genome Medicine
154 papers in training set
Top 3%
2.1%
12
Nature Machine Intelligence
61 papers in training set
Top 2%
1.9%
13
Cancer Cell
38 papers in training set
Top 1.0%
1.7%
14
Science
429 papers in training set
Top 14%
1.7%
15
Communications Biology
886 papers in training set
Top 8%
1.7%
16
Cell
370 papers in training set
Top 11%
1.7%
17
Advanced Science
249 papers in training set
Top 11%
1.7%
18
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
19
Nature Medicine
117 papers in training set
Top 3%
1.3%
20
Cancer Discovery
61 papers in training set
Top 1%
1.2%
21
Molecular Systems Biology
142 papers in training set
Top 1%
0.9%
22
Nature Cell Biology
99 papers in training set
Top 4%
0.9%
23
Nature Microbiology
133 papers in training set
Top 4%
0.9%
24
iScience
1063 papers in training set
Top 29%
0.8%
25
Nature Computational Science
50 papers in training set
Top 2%
0.8%
26
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
27
Nature Neuroscience
216 papers in training set
Top 6%
0.8%
28
Cell Reports Medicine
140 papers in training set
Top 8%
0.7%
29
Cell Metabolism
49 papers in training set
Top 3%
0.7%
30
Cell Reports Methods
141 papers in training set
Top 6%
0.6%