Back

scVIP: personalized modeling of single-cell transcriptomes for developmental and disease phenotypes

Lai, H.-Y.; Yoo, Y.; Tjaernberg, A.; Travaglini, K. J.; Agrawal, A.; Kana, O.; van Velthoven, C.; Carroll, J. B.; Qiao, Q.; Mukherjee, S.; Fardo, D. W.; Lein, E.; Gabitto, M. I.

2026-04-22 bioinformatics
10.64898/2026.04.20.717759 bioRxiv
Show abstract

Single-cell RNA sequencing reveals cellular heterogeneity, but linking cellular states to individual-level phenotypes remains challenging. We present scVIP, a generative framework that integrates transcriptional profiles and phenotypic markers to learn personalized individual-level embeddings using generative models and cell-type-aware multi-instance learning. scVIP predicts developmental age, disease progression, and neuropathology, while harmonizing datasets with distinct phenotype definitions. The model highlights disease-relevant cell populations and transcriptional programs underlying neurodegeneration.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 12%
14.0%
2
Nature Methods
336 papers in training set
Top 1%
8.2%
3
Nature Genetics
240 papers in training set
Top 0.9%
8.2%
4
Nature
575 papers in training set
Top 5%
6.2%
5
Science
429 papers in training set
Top 5%
6.2%
6
Nature Machine Intelligence
61 papers in training set
Top 0.5%
6.2%
7
Cell Systems
167 papers in training set
Top 3%
4.7%
50% of probability mass above
8
Genome Biology
555 papers in training set
Top 2%
4.7%
9
Nature Biotechnology
147 papers in training set
Top 2%
4.1%
10
Genome Medicine
154 papers in training set
Top 2%
3.9%
11
Nature Medicine
117 papers in training set
Top 1%
2.5%
12
Advanced Science
249 papers in training set
Top 9%
2.0%
13
Science Advances
1098 papers in training set
Top 13%
2.0%
14
Nature Cell Biology
99 papers in training set
Top 2%
2.0%
15
Nucleic Acids Research
1128 papers in training set
Top 10%
1.7%
16
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 33%
1.7%
17
Genome Research
409 papers in training set
Top 2%
1.6%
18
Nature Biomedical Engineering
42 papers in training set
Top 0.9%
1.6%
19
Cell Reports
1338 papers in training set
Top 27%
1.3%
20
Cell Genomics
162 papers in training set
Top 5%
1.2%
21
The American Journal of Human Genetics
206 papers in training set
Top 3%
1.2%
22
Cell
370 papers in training set
Top 16%
0.9%
23
Scientific Reports
3102 papers in training set
Top 71%
0.9%
24
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
25
Cell Reports Medicine
140 papers in training set
Top 8%
0.7%
26
Bioinformatics
1061 papers in training set
Top 10%
0.7%
27
Briefings in Bioinformatics
326 papers in training set
Top 8%
0.6%