Back

Cross-Species Morphology Learning Enables Nucleic Acid-Independent Detection of Live Mutant Blood Cells

Khan, S. A.; Faerber, D.; Kirkey, D.; Stirewalt, D.; Raffel, S.; Hadland, B.; Deininger, M.; Buettner, F.; Zhao, H. G.

2026-03-25 pathology
10.1101/2025.10.20.682949 bioRxiv
Show abstract

In both neonates and adults, the presence of malignancy-associated mutations in peripheral blood (PB) correlates with an elevated risk of future neoplastic transformation, with certain mutations, such as KMT2A rearrangements, exhibiting near-complete penetrance. If feasible, pre-malignant screening could enable early intervention and even disease prevention. However, nucleic acid sequencing- and hybridization-based mutation detection have limited cost-efficiency, constraining their use in screening. Here, we introduce a computer vision platform that can identify mutant cells in fresh PB samples that carry KMT2A-MLLT3 (a frequent mutation in pediatric and adult leukemias and detectable in newborn blood samples) or JAK2-V617F (a frequent mutation in myeloproliferative neoplasms and clonal hematopoiesis). This is achieved by high-throughput single-cell imaging and mutation detection by machine learning (ML)-powered morphology recognition. The ML models were developed by cross-species learning of conserved features between mutant cells from mouse genetic models and from human samples, enabling a cost-effective approach for detecting mutations in live blood cells. This platform holds promise for pre-malignant screening in asymptomatic neonates and adults with KMT2A-MLLT3 or JAK2-V617F mutation and is potentially generalizable to the detection other malignancy-associate mutations. Our platform provides a novel single-cell morphological data modality that complements existing single-cell genomics.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 6%
18.2%
2
Modern Pathology
21 papers in training set
Top 0.1%
8.4%
3
Nature Methods
336 papers in training set
Top 2%
6.4%
4
Communications Biology
886 papers in training set
Top 0.4%
6.3%
5
Advanced Science
249 papers in training set
Top 3%
4.8%
6
eLife
5422 papers in training set
Top 22%
4.0%
7
Light: Science & Applications
16 papers in training set
Top 0.2%
2.9%
50% of probability mass above
8
ACS Nano
99 papers in training set
Top 2%
2.4%
9
Scientific Reports
3102 papers in training set
Top 48%
2.4%
10
Medical Image Analysis
33 papers in training set
Top 0.5%
2.1%
11
Cell Reports Medicine
140 papers in training set
Top 3%
1.9%
12
Journal of Pathology Informatics
13 papers in training set
Top 0.2%
1.7%
13
PLOS ONE
4510 papers in training set
Top 54%
1.7%
14
iScience
1063 papers in training set
Top 15%
1.7%
15
Genome Medicine
154 papers in training set
Top 5%
1.7%
16
EMBO Molecular Medicine
85 papers in training set
Top 2%
1.3%
17
Cell Reports
1338 papers in training set
Top 28%
1.2%
18
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
19
Science Translational Medicine
111 papers in training set
Top 4%
1.1%
20
Clinical Chemistry
22 papers in training set
Top 0.6%
0.9%
21
Cell Reports Methods
141 papers in training set
Top 4%
0.9%
22
Nature Biotechnology
147 papers in training set
Top 6%
0.9%
23
eBioMedicine
130 papers in training set
Top 3%
0.9%
24
PLOS Biology
408 papers in training set
Top 17%
0.9%
25
New Phytologist
309 papers in training set
Top 4%
0.8%
26
Science Advances
1098 papers in training set
Top 28%
0.8%
27
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
28
National Science Review
22 papers in training set
Top 2%
0.7%
29
Cells
232 papers in training set
Top 7%
0.7%
30
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%