Back

Algorithm-Based Model for Gastrointestinal and Liver Histopathological Analysis Using VGG16 and Specialized Stains: Statistical Validation of Thresholds in AI-Driven Digital Pathology

Adeluwoye, A. O.; Gbadegesin, M. O.; James, F. M.; Otegbade, P. S.; Alabetutu, A.

2026-04-11 pathology
10.64898/2026.04.08.26350456 medRxiv
Show abstract

Digital pathology, coupled with advanced image recognition algorithms, represents a transformative frontier in histopathological diagnosis. This sub-Saharan African laboratorys exploratory study investigates the application of a Convolutional Neural Network (CNN) model, specifically leveraging the VGG16 architecture with transfer learning, for automated analysis and classification of selected gastrointestinal (GIT) and liver tissue samples, incorporating both routine and specialized staining protocols. The study utilized a dataset comprising 114 samples (18 liver, 96 GIT images) derived from archival formalin-fixed paraffin-embedded tissue blocks at University College Hospital, Ibadan, Nigeria. Specialized staining techniques included Alcian Yellow for GIT mucin visualization and Massons Trichrome for liver fibrosis assessment, alongside conventional H&E staining. Model performance was evaluated using statistical methodologies including Wilson Score confidence intervals (CI), Bayesian probability assessment, and effect size analysis. Results reveal a striking dichotomy in model performance. The GIT tissue model achieved perfect classification accuracy (100% test accuracy) with exceptional statistical significance (Z=10.0, p<0.0001), Wilson CI [96.29%, 99.99%], Cohens h=1.571, and Bayesian probability >99.99%. Conversely, the liver tissue model demonstrated diagnostic failure (42.86% test accuracy), with Z=-1.428, p=0.9236, Wilson CI [33.59%, 52.65%], Cohens h=-0.144, and Bayesian probability of 7.64%. This performance divergence correlates with training data availability, as the liver dataset fell far below empirically established thresholds (>100-200 samples) for reliable classification. The liver models failure reveals limitations in transfer learning with insufficient data. These findings underscore critical implications for AI-enhanced digital pathology, demonstrating potential deployment of the GIT model as a promising one that supports tissue-specific model development.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Journal of Pathology Informatics
13 papers in training set
Top 0.1%
17.6%
2
PLOS ONE
4510 papers in training set
Top 13%
14.4%
3
Scientific Reports
3102 papers in training set
Top 18%
6.4%
4
The Journal of Pathology
22 papers in training set
Top 0.1%
6.3%
5
Biology Methods and Protocols
53 papers in training set
Top 0.1%
4.9%
6
Journal of Clinical Pathology
12 papers in training set
Top 0.1%
4.4%
50% of probability mass above
7
Modern Pathology
21 papers in training set
Top 0.1%
4.2%
8
Computers in Biology and Medicine
120 papers in training set
Top 0.8%
3.7%
9
Diagnostics
48 papers in training set
Top 0.5%
3.6%
10
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.1%
11
Frontiers in Medicine
113 papers in training set
Top 3%
2.1%
12
Cureus
67 papers in training set
Top 2%
2.1%
13
British Journal of Cancer
42 papers in training set
Top 1.0%
1.5%
14
BMC Cancer
52 papers in training set
Top 2%
1.5%
15
GigaScience
172 papers in training set
Top 2%
1.3%
16
The American Journal of Tropical Medicine and Hygiene
60 papers in training set
Top 3%
1.3%
17
Nature Communications
4913 papers in training set
Top 56%
1.2%
18
Heliyon
146 papers in training set
Top 4%
1.0%
19
Cancers
200 papers in training set
Top 4%
1.0%
20
Clinical Chemistry
22 papers in training set
Top 0.7%
0.8%
21
Journal of Medical Imaging
11 papers in training set
Top 0.3%
0.8%
22
The American Journal of Pathology
31 papers in training set
Top 0.6%
0.7%
23
Malaria Journal
48 papers in training set
Top 2%
0.6%
24
Animals
20 papers in training set
Top 1%
0.6%
25
Viruses
318 papers in training set
Top 6%
0.6%