Back

Deep Learning Based Models for Preimplantation Mouse and Human Development

Proks, M.; Salehin, N.; Brickman, J. M.

2024-02-16 developmental biology
10.1101/2024.02.16.580649 bioRxiv
Show abstract

The rapid growth of single-cell transcriptomic technology has produced an increasing number of datasets for both embryonic development and in vitro pluripotent stem cell derived models. This avalanche of data about pluripotency and the process of lineage specification has meant it has become increasingly difficult to define specific cell types or states and compare these to in vitro differentiation. Here we utilize a set of deep learning (DL) tools to integrate and classify multiple datasets. This allows for the definition of both mouse and human embryo cell types, lineages and states, thereby maximising the information one can garner from these precious experimental resources. Our approaches are built on recent initiatives for large scale human organ atlases, but here we focus on the difficult to obtain and process material that spans early mouse, and in particular, human development. Using publicly available data for these stages, we test different deep learning approaches and develop both a model to classify cell types in an unbiased fashion and define the set of genes required to identify lineages, cell types and states. We have used our predictions to probe pluripotent stem cell models for both mouse and human development, showcasing the importance of this resource as a dynamic reference for early embryogenesis.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Development
440 papers in training set
Top 0.1%
41.5%
2
Developmental Biology
134 papers in training set
Top 0.4%
9.6%
50% of probability mass above
3
Scientific Reports
3102 papers in training set
Top 15%
6.7%
4
PLOS ONE
4510 papers in training set
Top 37%
3.8%
5
iScience
1063 papers in training set
Top 6%
3.2%
6
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 2%
2.9%
7
Frontiers in Physiology
93 papers in training set
Top 2%
2.0%
8
Biology Open
130 papers in training set
Top 0.7%
2.0%
9
Bioinformatics
1061 papers in training set
Top 7%
1.8%
10
Stem Cell Reports
118 papers in training set
Top 0.4%
1.8%
11
PLOS Computational Biology
1633 papers in training set
Top 17%
1.6%
12
Scientific Data
174 papers in training set
Top 1%
1.3%
13
Nature Communications
4913 papers in training set
Top 57%
1.2%
14
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
15
BMC Bioinformatics
383 papers in training set
Top 6%
0.8%
16
PLOS Biology
408 papers in training set
Top 18%
0.8%
17
BMC Methods
11 papers in training set
Top 0.2%
0.8%
18
Journal of Computational Biology
37 papers in training set
Top 0.6%
0.8%
19
Genome Biology
555 papers in training set
Top 7%
0.8%
20
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.8%
21
Nature Methods
336 papers in training set
Top 6%
0.8%
22
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.8%
23
Nature Machine Intelligence
61 papers in training set
Top 3%
0.8%
24
Cell Genomics
162 papers in training set
Top 6%
0.8%
25
Cell Reports
1338 papers in training set
Top 35%
0.7%
26
Developmental Dynamics
50 papers in training set
Top 0.8%
0.7%
27
Cytometry Part A
30 papers in training set
Top 0.4%
0.5%
28
Biology Methods and Protocols
53 papers in training set
Top 3%
0.5%
29
Cell Reports Methods
141 papers in training set
Top 6%
0.5%