Back

Methodological and Clinical Validation of TholdStormDX v0.0.1: An Advanced Stochastic Engine for the Optimization of Thresholds and Multimarker Panels Applied to Oncology

Reinosa, R.

2026-04-27 oncology
10.64898/2026.04.24.26351692 medRxiv
Show abstract

Introduction: The translation of biomarkers into binary clinical decisions requires the determination of precise cut-off points. This study validates the TholdStormDX v0.0.1 tool, a mathematical engine that employs Dual Annealing, 2- and 4-parameter logistic fitting, and vectorized Monte Carlo simulations for panel optimization under Boolean OR logic. Methods: The tool was evaluated using datasets from four diagnostic domains (Pulmonary Nodules, Hepatocellular Carcinoma [HCC], Cervical Cancer, and Breast Cancer), along with a prognosis-oriented analytical context (Breast Cancer). Validation followed a strict workflow: characterization and selection of the best individual and combined thresholds in the Training (Train) and Validation (Val) sets, using the Test set in a completely independent manner, solely to assess the model s performance and generalizability. Results: The tool enabled precise derivation of cut-off points for both individual biomarkers and multivariable combinations. Evaluation on the Test set objectively demonstrated in which scenarios a single biomarker outperforms a complex panel, promoting clinical parsimony. For example, in Breast Cancer diagnosis, an individual predictor outperformed the optimized panel (Sensitivity: 0.953 / Specificity: 0.952 in Test); conversely, in Hepatocellular Carcinoma, the multivariable combination showed superior performance compared to the single marker (Sens: 0.707 / Spe: 0.718 in Test). Additionally, the self-auditing system effectively flagged metric degradation when noisy variables were included, preventing potential issues. Conclusion: TholdStormDX v0.0.1 proves to be a robust and transparent bioinformatics platform for deriving clinical thresholds. Its main contribution lies in mitigating local minima and promoting clinical parsimony, enabling researchers to objectively identify when a single biomarker is sufficient and when a panel provides real added value. Furthermore, it transforms the problem of biological noise into a safety feature: by systematically warning about algorithmic instability, it prevents overfitting and ensures the clinical viability of medical decisions. Availability: The software is free and distributed under the GNU GPLv3 license. TholdStormDX v0.0.1 is written in Python, and its source code is available at the following GitHub address: https://github.com/roberto117343/TholdStormDX.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.6%
13.9%
2
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
8.2%
3
PeerJ
261 papers in training set
Top 0.5%
6.6%
4
Biology Methods and Protocols
53 papers in training set
Top 0.1%
6.1%
5
Computers in Biology and Medicine
120 papers in training set
Top 0.4%
6.1%
6
Frontiers in Bioinformatics
45 papers in training set
Top 0.1%
4.2%
7
PLOS ONE
4510 papers in training set
Top 37%
3.8%
8
PLOS Computational Biology
1633 papers in training set
Top 9%
3.8%
50% of probability mass above
9
Artificial Intelligence in Medicine
15 papers in training set
Top 0.1%
3.7%
10
Scientific Reports
3102 papers in training set
Top 38%
3.6%
11
GigaScience
172 papers in training set
Top 0.6%
3.5%
12
Cancers
200 papers in training set
Top 2%
2.8%
13
NAR Genomics and Bioinformatics
214 papers in training set
Top 1.0%
2.8%
14
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.4%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.0%
16
Frontiers in Oncology
95 papers in training set
Top 2%
1.8%
17
International Journal of Molecular Sciences
453 papers in training set
Top 8%
1.6%
18
Annals of Biomedical Engineering
34 papers in training set
Top 0.8%
1.4%
19
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.6%
1.2%
20
Metabolites
50 papers in training set
Top 0.9%
0.9%
21
Journal of Translational Medicine
46 papers in training set
Top 2%
0.9%
22
JCO Precision Oncology
14 papers in training set
Top 0.3%
0.9%
23
iScience
1063 papers in training set
Top 28%
0.9%
24
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
25
European Journal of Cancer
10 papers in training set
Top 0.6%
0.7%
26
Life
27 papers in training set
Top 0.7%
0.6%
27
BMC Cancer
52 papers in training set
Top 3%
0.6%
28
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.6%
0.6%
29
Journal of Medical Internet Research
85 papers in training set
Top 5%
0.6%
30
Interface Focus
14 papers in training set
Top 0.4%
0.6%