Back

DeepHeme: A generalizable, bone marrow classifierwith hematopathologist-level performance

Goldgof, G.; Sun, S.; Cleaves, J.; Wang, L.; Lucas, F.; Brown, L.; Spectors, J.; Boiocchi, L.; Baik, J.; Zhu, M.; Ardon, O.; Lu, C.; Dogan, A.; Goldgof, D.; Carmichael, I.; Prakash, S.; Butte, A.

2023-02-21 bioinformatics
10.1101/2023.02.20.528987 bioRxiv
Show abstract

Morphology-based classification of cells in the bone marrow aspirate (BMA) is a key step in the diagnosis and management of hematologic malignancies. However, it is time-intensive and must be performed by expert hematopathologists and laboratory professionals. We curated a large, high-quality dataset of 41,595 hematopathologist consensus-annotated single-cell images extracted from BMA whole slide images (WSIs) containing 23 morphologic classes from the clinical archives of the University of California, San Francisco. We trained a convolutional neural network, DeepHeme, to classify images in this dataset, achieving a mean area under the curve (AUC) of 0.99. DeepHeme was then externally validated on WSIs from Memorial Sloan Kettering Cancer Center, with a similar AUC of 0.98, demonstrating robust generalization. When compared to individual hematopathologists from three different top academic medical centers, the algorithm outperformed all three. Finally, DeepHeme reliably identified cell states such as mitosis, paving the way for image-based quantification of mitotic index in a cell-specific manner, which may have important clinical applications.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Clinical Chemistry
22 papers in training set
Top 0.1%
17.6%
2
Cytometry Part A
30 papers in training set
Top 0.1%
14.5%
3
npj Precision Oncology
48 papers in training set
Top 0.1%
9.2%
4
Modern Pathology
21 papers in training set
Top 0.1%
4.9%
5
Scientific Reports
3102 papers in training set
Top 31%
4.0%
50% of probability mass above
6
Genome Medicine
154 papers in training set
Top 2%
3.6%
7
Nature Communications
4913 papers in training set
Top 42%
3.1%
8
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.3%
2.4%
9
PLOS Computational Biology
1633 papers in training set
Top 13%
2.4%
10
PLOS ONE
4510 papers in training set
Top 48%
2.1%
11
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.9%
12
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.7%
13
Bioinformatics
1061 papers in training set
Top 8%
1.3%
14
Communications Biology
886 papers in training set
Top 12%
1.3%
15
Cancer Research Communications
46 papers in training set
Top 0.8%
1.1%
16
Nature Machine Intelligence
61 papers in training set
Top 3%
1.0%
17
Journal of Pathology Informatics
13 papers in training set
Top 0.3%
1.0%
18
Cell Reports Medicine
140 papers in training set
Top 6%
0.9%
19
Cancers
200 papers in training set
Top 4%
0.8%
20
Leukemia
39 papers in training set
Top 0.7%
0.8%
21
Clinical Infectious Diseases
231 papers in training set
Top 5%
0.8%
22
BMC Medical Genomics
36 papers in training set
Top 1%
0.8%
23
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
24
Science Advances
1098 papers in training set
Top 31%
0.7%
25
Advanced Science
249 papers in training set
Top 21%
0.6%
26
Nucleic Acids Research
1128 papers in training set
Top 20%
0.6%
27
Ophthalmology Science
20 papers in training set
Top 0.3%
0.6%
28
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.6%
29
Cell Reports Methods
141 papers in training set
Top 6%
0.6%
30
Small Methods
26 papers in training set
Top 1%
0.6%