Back

Integrated Framework for the Optimal Determination of Diagnostic Cut-off Points through Empirical Interpolation, Logistic Modeling Optimized by Dual Annealing, and Combinatorial Optimization with ThresholdXpert: Application to Hepatocellular Carcinoma

Reinosa, R.

2026-02-23 oncology
10.64898/2026.02.19.26346674 medRxiv
Show abstract

IntroductionThe precise determination of diagnostic cut-off points is essential for the development of multimarker panels in oncology. In previous work on pulmonary nodules, it was observed that the standard two-parameter logistic fit could be insufficient for biomarkers with asymmetric distributions. Furthermore, the calculation of empirical cut-off points based on graphical visualization presented limitations in precision and reproducibility. ObjectiveThis study presents a methodological advancement in the data analysis phase (Stage 1), introducing new Python algorithms for the direct analytical calculation of empirical intersections and robust mathematical modeling using Dual Annealing with both two-parameter and four-parameter logistic functions. This improved methodology feeds into the ThresholdXpert 1.0 software tool for combinatorial optimization of biomarker panels (Stage 2), and is applied here to the diagnostic challenge of hepatocellular carcinoma (HCC). MethodsThe methodology was first validated by re-analyzing a dataset of patients with pulmonary nodules (N=895). It was subsequently applied to an HCC dataset derived from the cohort of Jang et al. (208 HCC, 193 cirrhosis, 401 total), randomly divided into a training set (280) and an independent test set (121). Scripts were developed to compare the previous two-parameter logistic fit with the new two- and four-parameter logistic models. Finally, ThresholdXpert 1.0 was used for multimarker panel optimization. ResultsThe integration of empirical calculation, logistic modeling, and combinatorial optimization through ThresholdXpert 1.0 provides a robust and coherent framework for the development of multimarker diagnostic panels. The four-parameter logistic model provided additional validation without substantially modifying cut-off values for most biomarkers, confirming the stability of the approach while offering greater flexibility for complex distributions. When applied to hepatocellular carcinoma, the framework identified a molecular panel composed of AFP, PIVKA-II, OPN, and DKK-1 with sensitivity of 0.77 and specificity of 0.72, and an optimized panel incorporating inverse MELD that achieved the best overall balance (sensitivity 0.73, specificity 0.75) in independent external validation. These results demonstrate the potential of this approach as a generalizable tool for the optimized design of binary diagnostic systems in oncology. ConclusionThe integration of complementary mathematical modeling enhances the capability of ThresholdXpert 1.0 to identify robust diagnostic panels, as in some cases a single biomarker may outperform biomarker combinations, and vice versa. This approach enabled the integration of molecular biomarkers and clinical variables under a unified mathematical framework. Contactroberto117343@gmail.com

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Biology Methods and Protocols
53 papers in training set
Top 0.1%
12.2%
2
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
7.1%
3
BMC Bioinformatics
383 papers in training set
Top 1%
6.7%
4
PLOS ONE
4510 papers in training set
Top 28%
6.3%
5
Frontiers in Oncology
95 papers in training set
Top 0.6%
6.3%
6
Cancers
200 papers in training set
Top 1%
4.3%
7
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
8
BMC Cancer
52 papers in training set
Top 0.7%
3.6%
50% of probability mass above
9
Scientific Reports
3102 papers in training set
Top 38%
3.6%
10
British Journal of Cancer
42 papers in training set
Top 0.5%
3.0%
11
Frontiers in Bioinformatics
45 papers in training set
Top 0.1%
3.0%
12
Computers in Biology and Medicine
120 papers in training set
Top 1%
2.7%
13
Diagnostics
48 papers in training set
Top 0.6%
2.6%
14
PeerJ
261 papers in training set
Top 5%
2.1%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.9%
16
Clinical Chemistry
22 papers in training set
Top 0.3%
1.8%
17
Cancer Medicine
24 papers in training set
Top 0.8%
1.7%
18
Artificial Intelligence in Medicine
15 papers in training set
Top 0.3%
1.6%
19
JCO Precision Oncology
14 papers in training set
Top 0.2%
1.6%
20
npj Precision Oncology
48 papers in training set
Top 0.8%
1.3%
21
FEBS Open Bio
29 papers in training set
Top 0.3%
1.2%
22
Metabolites
50 papers in training set
Top 0.7%
1.2%
23
Annals of Biomedical Engineering
34 papers in training set
Top 0.9%
1.2%
24
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.1%
25
Bioinformatics
1061 papers in training set
Top 9%
0.8%
26
JMIR Medical Informatics
17 papers in training set
Top 2%
0.7%
27
Journal of Magnetic Resonance Imaging
14 papers in training set
Top 0.6%
0.7%
28
Journal of Medical Internet Research
85 papers in training set
Top 5%
0.7%
29
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
30
Journal of Translational Medicine
46 papers in training set
Top 3%
0.7%