Back

Defining early human developmental identity: A curated, cross-platform marker framework

Rossi, A.; Dobner, J.; Prigione, A.

2026-03-30 developmental biology
10.64898/2026.03.26.714482 bioRxiv
Show abstract

Early human development involves dynamic transitions in cell identity, including transient transcriptional modulation and stable lineage commitment. Distinguishing these types of gene expression changes is challenging and can be further exacerbated by genetic and experimental heterogeneity in the context of human pluripotent stem cell (hPSC) research. To address this challenge and help uncover transcriptional changes indicative of true developmental state, we establish a curated, cross-platform marker framework for robust identification of pluripotency and early germ-layer identity. Starting from an unbiased RNA-seq discovery set, we systematically validate candidate markers across qPCR, bulk and single-cell RNA sequencing, and quantitative proteomics platforms, yielding a refined panel of 67 markers (20 for the undifferentiated state, 17 for endoderm, 15 for ectoderm, and 15 for mesoderm). We show that this framework reliably identifies early developmental states across heterogeneous datasets, generalizes to in vivo human embryo cell types, and preserves lineage identity despite substantial transcriptional variability. Furthermore, we demonstrate concordant protein-level expression for a subset of markers, supported by deep proteomic profiling of the reference line KOLF2.1J. To enable broad application, we introduce DeepDiff, a web-based resource integrating the validated markers, allowing automated fate classification in a user-friendly interface. Together, this work provides a standardized framework for defining early human developmental identity and disentangling lineage commitment from context-dependent modulation.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 10%
14.5%
2
Cell Reports Methods
141 papers in training set
Top 0.2%
9.9%
3
Development
440 papers in training set
Top 0.4%
6.3%
4
Developmental Cell
168 papers in training set
Top 4%
4.8%
5
Nature Biotechnology
147 papers in training set
Top 2%
4.8%
6
Scientific Reports
3102 papers in training set
Top 25%
4.8%
7
Cell Reports
1338 papers in training set
Top 11%
4.3%
8
Cell Systems
167 papers in training set
Top 3%
4.2%
50% of probability mass above
9
Nucleic Acids Research
1128 papers in training set
Top 5%
3.9%
10
Nature Cell Biology
99 papers in training set
Top 1%
3.5%
11
Cell Stem Cell
57 papers in training set
Top 0.8%
2.4%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 26%
2.3%
13
Advanced Science
249 papers in training set
Top 9%
2.0%
14
Stem Cell Reports
118 papers in training set
Top 0.4%
1.9%
15
iScience
1063 papers in training set
Top 13%
1.8%
16
PLOS ONE
4510 papers in training set
Top 55%
1.7%
17
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 4%
1.7%
18
Cell
370 papers in training set
Top 11%
1.7%
19
Genome Biology
555 papers in training set
Top 5%
1.5%
20
Nature Neuroscience
216 papers in training set
Top 5%
1.3%
21
Molecular & Cellular Proteomics
158 papers in training set
Top 1%
1.2%
22
Nature Genetics
240 papers in training set
Top 6%
1.2%
23
Nature
575 papers in training set
Top 13%
1.2%
24
Communications Biology
886 papers in training set
Top 15%
1.2%
25
Science
429 papers in training set
Top 17%
1.1%
26
Science Translational Medicine
111 papers in training set
Top 5%
0.9%
27
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.9%
28
eLife
5422 papers in training set
Top 56%
0.8%
29
Nature Methods
336 papers in training set
Top 6%
0.8%
30
Science Advances
1098 papers in training set
Top 32%
0.7%