Back

A Cross-Study Multi-Organ Cell Atlas ofMacaca fascicularis Informed by Human Foundation Model Annotation: A Resource for Translational Target Assessment

Souza, T. M.; Gamse, J. T.; Moreno, L.; van Rumpt, M.; Nunez-Moreno, G.; Khatri, I.; van Asten, S. D.; Khusial, N. V.; Baltasar-Perez, E.; Adhav, R.; Abdelaal, T.; Wojtuszkiewicz, A.; Calis, J. J. A.; Csala, A.; Dahlman, A.; Fuller, C. L.; Thalhauser, C. J.; Kolder, I. C. R. M.

2026-03-19 bioinformatics
10.64898/2026.03.17.711997 bioRxiv
Show abstract

Non-human primates (NHPs), particularly Macaca fascicularis (cynomolgus macaque), represent an essential model for preclinical assessment of biologics due to their high genetic and physiological similarity to humans. However, mounting regulatory pressure to reduce NHP use and the lack of a unified, well-annotated single-cell atlas currently limits both target qualification and mechanistic interpretation of toxicity in this species. To address this gap, we assembled and harmonized the largest single-cell transcriptomic atlas of M. fascicularis to date, integrating 30 publicly available studies spanning 57 anatomical regions, 43 organs and 14 physiological systems. We implemented a scalable framework for cross-species cell type annotation by embedding both cynomolgus monkeys and human (Tabula Sapiens V2) datasets into a shared reference space using Universal Cell Embeddings (UCE), enabling consistent harmonization of cell identities. In total, 27 organs were annotated using human reference labels, while the remaining sets retained author-provided annotations or labels transferred from other cynomolgus studies with available annotations. The resulting atlas comprises over 2.5 million cells and demonstrates concordance in cell-type-specific expression patterns between cynomolgus and humans, including tissue-specific markers and targets relevant for biologics development. Through translational use cases, we illustrate how this resource can be applied to assess target expression in tissues affected by concordant human-NHP toxicities, investigate ocular adverse events associated with antibody-drug conjugates (ADCs), and identify species-specific features of immune cell subtypes with known safety implications. By enabling scalable, high-resolution, cross-species comparisons of gene expression across organs, tissues, and cell states, this atlas supports improved target qualification, more mechanistic interpretation of toxicities, and evidence-based decisions on the relevance and design of NHP studies. Collectively, this work provides a unified cross-species single-cell resource for cynomolgus monkey and a modular computational framework that advances new approach methodologies and contributes to the refinement and reduction of NHP use in preclinical research.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 13%
12.5%
2
Genome Medicine
154 papers in training set
Top 0.6%
8.9%
3
Advanced Science
249 papers in training set
Top 3%
6.3%
4
Scientific Data
174 papers in training set
Top 0.3%
4.9%
5
Communications Biology
886 papers in training set
Top 0.6%
4.9%
6
Cell Reports Medicine
140 papers in training set
Top 1%
4.0%
7
Cell Reports
1338 papers in training set
Top 14%
3.7%
8
Nature Biotechnology
147 papers in training set
Top 2%
3.6%
9
Cell Reports Methods
141 papers in training set
Top 1%
3.3%
50% of probability mass above
10
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.1%
11
Science Advances
1098 papers in training set
Top 10%
2.6%
12
Nature Biomedical Engineering
42 papers in training set
Top 0.4%
2.6%
13
Scientific Reports
3102 papers in training set
Top 53%
1.9%
14
Nucleic Acids Research
1128 papers in training set
Top 9%
1.9%
15
JCI Insight
241 papers in training set
Top 3%
1.7%
16
mAbs
28 papers in training set
Top 0.2%
1.5%
17
Cell Genomics
162 papers in training set
Top 4%
1.5%
18
iScience
1063 papers in training set
Top 18%
1.5%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.3%
20
Cancer Cell
38 papers in training set
Top 1%
1.3%
21
Biomaterials
78 papers in training set
Top 0.8%
1.2%
22
Cancer Research Communications
46 papers in training set
Top 0.7%
1.2%
23
Bioinformatics Advances
184 papers in training set
Top 4%
1.2%
24
PLOS ONE
4510 papers in training set
Top 60%
1.2%
25
Frontiers in Immunology
586 papers in training set
Top 5%
1.2%
26
Cell Systems
167 papers in training set
Top 9%
1.1%
27
Bioinformatics
1061 papers in training set
Top 8%
1.0%
28
PLOS Computational Biology
1633 papers in training set
Top 21%
1.0%
29
Journal of Translational Medicine
46 papers in training set
Top 2%
0.8%
30
eLife
5422 papers in training set
Top 55%
0.8%