Back

Integrating Quantitative Histology with Clinical Data Improves Prediction of Cervical Intraepithelial Neoplasia Regression

Lehtonen, O.; Nordlund, N.; Kahelin, E.; Bergqvist, L.; Aro, K.; Hautaniemi, S.; Kalliala, I.; Virtanen, A.

2026-01-22 obstetrics and gynecology
10.64898/2026.01.21.26344510 medRxiv
Show abstract

Cervical intraepithelial neoplasia grade 2 (CIN2) lesions show variable outcomes, and accurate prediction of regression remains a major clinical challenge. We developed an interpretable machine learning pipeline that integrates quantitative histological, clinical, and human papillomavirus (HPV) -genotyping data to predict lesion regression within one and two years. Using panoptic segmentation of routine hematoxylin and eosin (H&E) -stained biopsies, we extracted human-interpretable morphological and immune cell infiltration related features that capture the key histopathological characteristics of CIN2 and identified features that predicted lesion regression. Further, integrating these features to predictive clinical features achieved higher predictive accuracy than clinical variables alone. These findings demonstrate that quantitative, interpretable analysis of H&E histology of routine diagnostic biopsies contains relevant information that predicts the natural history of CIN2 lesions. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=121 SRC="FIGDIR/small/26344510v1_ufig1.gif" ALT="Figure 1"> View larger version (38K): org.highwire.dtl.DTLVardef@e8ac93org.highwire.dtl.DTLVardef@199f7c6org.highwire.dtl.DTLVardef@159ee1dorg.highwire.dtl.DTLVardef@11fc720_HPS_FORMAT_FIGEXP M_FIG Created in BioRender. Lehtonen, O. (2026) https://BioRender.com/rlnkbkp C_FIG

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
24.0%
2
Clinical Cancer Research
58 papers in training set
Top 0.1%
9.0%
3
Scientific Reports
3102 papers in training set
Top 11%
7.6%
4
iScience
1063 papers in training set
Top 2%
4.6%
5
Heliyon
146 papers in training set
Top 0.2%
4.2%
6
npj Digital Medicine
97 papers in training set
Top 1%
3.8%
50% of probability mass above
7
Modern Pathology
21 papers in training set
Top 0.1%
3.8%
8
Clinical Chemistry
22 papers in training set
Top 0.2%
3.3%
9
eLife
5422 papers in training set
Top 32%
2.6%
10
Nature Communications
4913 papers in training set
Top 48%
2.0%
11
PLOS ONE
4510 papers in training set
Top 49%
2.0%
12
BMC Biology
248 papers in training set
Top 1.0%
1.8%
13
Journal of Pathology Informatics
13 papers in training set
Top 0.2%
1.8%
14
The Lancet Digital Health
25 papers in training set
Top 0.5%
1.6%
15
Cell Reports Medicine
140 papers in training set
Top 4%
1.6%
16
Patterns
70 papers in training set
Top 1%
1.4%
17
Communications Medicine
85 papers in training set
Top 0.5%
1.2%
18
Cancers
200 papers in training set
Top 4%
1.0%
19
Bioinformatics
1061 papers in training set
Top 8%
1.0%
20
Frontiers in Medicine
113 papers in training set
Top 5%
1.0%
21
The Journal of Pathology
22 papers in training set
Top 0.3%
0.9%
22
PLOS Computational Biology
1633 papers in training set
Top 21%
0.9%
23
Communications Biology
886 papers in training set
Top 21%
0.8%
24
npj Precision Oncology
48 papers in training set
Top 1%
0.8%
25
Frontiers in Bioinformatics
45 papers in training set
Top 0.7%
0.8%
26
Nature Medicine
117 papers in training set
Top 4%
0.8%
27
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.7%
28
Breast Cancer Research
32 papers in training set
Top 0.6%
0.7%
29
BMC Medicine
163 papers in training set
Top 8%
0.7%
30
Journal of Medical Imaging
11 papers in training set
Top 0.4%
0.7%