Back

Model for Ki-67 Proliferation Index Predicton, Trained End-to-End on Routine Diagnostic Data

Kukucka, A.; Obdrzalek, J.; Musil, V.; Nenutil, R.; Holub, P.; Brazdil, T.

2025-06-26 pathology
10.1101/2025.06.26.25330333 medRxiv
Show abstract

For breast cancer, the Ki-67 index gives important information on the patients prognosis and may predict the response to therapy. However, semi-automated methods for Ki-67 index calculation are prone to intra-and inter-observer variability, while fully automated machine learning models based on nuclei segmentation, classification and counting require training on large datasets with precise annotations down to the level of individual nuclei, which are hard to obtain. We design a neural network that straightforwardly predicts the Ki-67 index from scans of H&DAB-stained tissue samples. The network is trained only on existing data from daily operations at Masaryk Memorial Cancer Institute, Brno. The image labels contain only the Ki-67 index without any tumour epithelium or nuclei annotations. We use a simple convolutional neural network, not biasing the network by incorporation of layers dedicated to epithelium or nuclei segmentation or classification. Our models predictions align with the state-of-the-art evaluation by pathologists using QuPath image analysis with manual tumour annotation. On a test set consisting of 1250 images, the model achieved the mean absolute error of 3.668 and Pearsons correlation coefficient of 0.959 (p < 0.001). Surprisingly, despite using a simple architecture and very weak supervision, the model persuasively detects complex morphological structures such as tumour epithelium. The model also works on Whole Slide Image data, e.g. to detect the hotspot areas. Since our approach does not need any specifically labelled data or additional staining, it is cost-effective and allows easy domain adaptation.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Pathology Informatics
13 papers in training set
Top 0.1%
38.2%
2
Scientific Reports
3102 papers in training set
Top 9%
8.5%
3
Modern Pathology
21 papers in training set
Top 0.1%
6.4%
50% of probability mass above
4
PLOS ONE
4510 papers in training set
Top 26%
6.4%
5
Nature Communications
4913 papers in training set
Top 36%
4.2%
6
Cancers
200 papers in training set
Top 2%
3.6%
7
iScience
1063 papers in training set
Top 9%
2.4%
8
Breast Cancer Research
32 papers in training set
Top 0.3%
1.9%
9
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
10
Journal of Medical Imaging
11 papers in training set
Top 0.1%
1.7%
11
npj Breast Cancer
18 papers in training set
Top 0.1%
1.5%
12
Medical Image Analysis
33 papers in training set
Top 0.7%
1.5%
13
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.2%
14
npj Precision Oncology
48 papers in training set
Top 1.0%
1.0%
15
New Phytologist
309 papers in training set
Top 4%
1.0%
16
Communications Biology
886 papers in training set
Top 17%
1.0%
17
GigaScience
172 papers in training set
Top 2%
0.9%
18
Bioinformatics
1061 papers in training set
Top 9%
0.9%
19
Communications Medicine
85 papers in training set
Top 0.9%
0.8%
20
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.8%
21
Laboratory Investigation
13 papers in training set
Top 0.3%
0.8%
22
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 6%
0.8%
23
Genome Biology
555 papers in training set
Top 7%
0.8%
24
JAMIA Open
37 papers in training set
Top 1%
0.8%
25
Cancer Research
116 papers in training set
Top 4%
0.7%
26
Biological Imaging
15 papers in training set
Top 0.3%
0.7%
27
JNCI Cancer Spectrum
10 papers in training set
Top 0.6%
0.7%
28
The Lancet Digital Health
25 papers in training set
Top 1%
0.7%
29
Biology Methods and Protocols
53 papers in training set
Top 4%
0.5%