Model for Ki-67 Proliferation Index Predicton, Trained End-to-End on Routine Diagnostic Data

Kukucka, A.; Obdrzalek, J.; Musil, V.; Nenutil, R.; Holub, P.; Brazdil, T.

2025-06-26 pathology

10.1101/2025.06.26.25330333 medRxiv

Show abstract

For breast cancer, the Ki-67 index gives important information on the patients prognosis and may predict the response to therapy. However, semi-automated methods for Ki-67 index calculation are prone to intra-and inter-observer variability, while fully automated machine learning models based on nuclei segmentation, classification and counting require training on large datasets with precise annotations down to the level of individual nuclei, which are hard to obtain. We design a neural network that straightforwardly predicts the Ki-67 index from scans of H&DAB-stained tissue samples. The network is trained only on existing data from daily operations at Masaryk Memorial Cancer Institute, Brno. The image labels contain only the Ki-67 index without any tumour epithelium or nuclei annotations. We use a simple convolutional neural network, not biasing the network by incorporation of layers dedicated to epithelium or nuclei segmentation or classification. Our models predictions align with the state-of-the-art evaluation by pathologists using QuPath image analysis with manual tumour annotation. On a test set consisting of 1250 images, the model achieved the mean absolute error of 3.668 and Pearsons correlation coefficient of 0.959 (p < 0.001). Surprisingly, despite using a simple architecture and very weak supervision, the model persuasively detects complex morphological structures such as tumour epithelium. The model also works on Whole Slide Image data, e.g. to detect the hotspot areas. Since our approach does not need any specifically labelled data or additional staining, it is cost-effective and allows easy domain adaptation.

Model for Ki-67 Proliferation Index Predicton, Trained End-to-End on Routine Diagnostic Data

Matching journals