Back

Machine-learning convergent melanocytic morphology despite noisy archival slides

Tada, M.; Gaskins, G.; Ghandian, S.; Mew, N.; Keiser, M. J.; Keiser, E. S.

2024-09-17 pathology
10.1101/2024.09.12.612732 bioRxiv
Show abstract

Melanocytic atypia, ranging from benign to malignant, often leads to diagnostic discordance, complicating its prediction by machine learning models. To overcome this, we paired H&E-stained histology images with contiguous or serial sections immunohistochemically (IHC) stained for melanocytic cells via antibodies for MelanA, MelPro, or SOX10. We developed a deep-learning pipeline to identify melanocytic atypia by digitizing a real-world archival dataset of 122 paired whole slide images from 61 confirmed melanoma in situ (MIS) cases at two institutions. Only 37.7% of the cases contained tissue pairs that matched well enough for deep learning. Nonetheless, the MelanA+MelPro models achieved an average area under the receiver-operating characteristic (AUROC) of 0.948 and an average area under the precision-recall curve (AUPRC) of 0.611, while the SOX10 models had an average of 0.867 AUROC and 0.433 AUPRC. Despite learning from biologically different IHC stains, the convolutional neural network (CNN) models independently exhibited an intuitive convergent rationale by explainable AI saliency calculations. Different antibodies, with nuclear versus cytoplasmic staining, provided complementary yet consistent information, which the CNNs integrated effectively. The resulting multi-antibody virtual stains identified morphologic cytologic and small-scale architectural features directly from H&E-stained histology images, which can assist pathologists in assessing cutaneous MIS.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Modern Pathology
21 papers in training set
Top 0.1%
29.7%
2
Journal of Pathology Informatics
13 papers in training set
Top 0.1%
15.8%
3
Nature Communications
4913 papers in training set
Top 31%
5.2%
50% of probability mass above
4
Laboratory Investigation
13 papers in training set
Top 0.1%
3.9%
5
Scientific Reports
3102 papers in training set
Top 32%
3.8%
6
The Journal of Pathology
22 papers in training set
Top 0.1%
2.2%
7
Cancers
200 papers in training set
Top 2%
2.2%
8
New Phytologist
309 papers in training set
Top 3%
2.2%
9
PLOS Computational Biology
1633 papers in training set
Top 15%
1.8%
10
PLOS ONE
4510 papers in training set
Top 51%
1.8%
11
Communications Biology
886 papers in training set
Top 7%
1.8%
12
npj Digital Medicine
97 papers in training set
Top 2%
1.4%
13
eBioMedicine
130 papers in training set
Top 2%
1.4%
14
npj Precision Oncology
48 papers in training set
Top 0.8%
1.3%
15
Clinical Chemistry
22 papers in training set
Top 0.6%
1.0%
16
eLife
5422 papers in training set
Top 51%
1.0%
17
Cancer Research
116 papers in training set
Top 3%
1.0%
18
The Journal of Molecular Diagnostics
36 papers in training set
Top 0.3%
1.0%
19
The American Journal of Pathology
31 papers in training set
Top 0.3%
1.0%
20
ACS Nano
99 papers in training set
Top 3%
0.8%
21
iScience
1063 papers in training set
Top 30%
0.8%
22
Neuropathology and Applied Neurobiology
14 papers in training set
Top 0.6%
0.8%
23
Diagnostics
48 papers in training set
Top 2%
0.7%
24
Science Translational Medicine
111 papers in training set
Top 7%
0.7%
25
Nature Machine Intelligence
61 papers in training set
Top 4%
0.7%
26
The Lancet Digital Health
25 papers in training set
Top 1%
0.7%
27
Journal of Medical Imaging
11 papers in training set
Top 0.4%
0.7%
28
European Journal of Cancer
10 papers in training set
Top 0.7%
0.5%
29
JNCI Cancer Spectrum
10 papers in training set
Top 0.6%
0.5%
30
GigaScience
172 papers in training set
Top 4%
0.5%