Back

A Governance-Driven, Real-World Data-Calibrated Health Informatics Framework for Longitudinal Utilization Forecasting in Oncology and Complex Chronic Conditions

Dantuluri, A. V. S. R.; Kumar, S.

2026-02-26 health informatics
10.64898/2026.02.23.26346919 medRxiv
Show abstract

BackgroundHealthcare utilization forecasting systems are often derived from static, annualized market share assumptions that fail to represent real-world treatment dynamics. Such approaches systematically misestimate future utilization by ignoring longitudinal treatment sequencing, discontinuation with surveillance, recurrence-driven re-entry, and provider adoption dynamics. ObjectiveThis study proposes a reusable, governance-driven health informatics forecasting framework designed to generate realistic utilization forecasts using real-world data by integrating longitudinal patient-flow modeling, persistence-based exposure estimation, provider behavioral adoption, and multi-source calibration into a single architecture. MethodsLongitudinal U.S. administrative claims data representing oncology treatment populations (approximately 80,000 treated patients annually across therapy lines) were curated through a governance layer that refines diagnosis and treatment pools, infers clinically valid lines of therapy, and corrects for lookback-limited recurrence bias. Patients were modeled as transitioning across explicit clinical states, including treatment initiation, sequential therapy lines, discontinuation, surveillance, and recurrence-driven re-entry. Forecast outputs were calibrated using volume-weighted and behaviorally dampened provider adoption dynamics derived from primary research and claims-revealed utilization and evaluated against static share-based forecasts under identical peak-share assumptions. ResultsAcross multiple oncology contexts, longitudinal patient-flow-based forecasting recovered approximately 50-70% more cumulative treated months than static approaches. Underestimation in traditional models was driven primarily by failure to capture later-line persistence, surveillance exit, and re-treatment dynamics. Setting-specific calibration revealed earlier adoption in academic centers and slower, payer-constrained uptake in community practices. ConclusionsThe proposed framework demonstrates a forecast-oriented health informatics architecture that improves utilization estimation and decision support in complex, longitudinal care ecosystems. The methodology generalizes across tumor types and chronic conditions characterized by treatment sequencing, persistence variability, and relapse-driven re-entry.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
22.7%
2
npj Digital Medicine
97 papers in training set
Top 0.3%
14.8%
3
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
6.4%
4
Journal of Medical Internet Research
85 papers in training set
Top 1%
4.2%
5
JAMIA Open
37 papers in training set
Top 0.3%
4.0%
50% of probability mass above
6
JMIR Medical Informatics
17 papers in training set
Top 0.3%
4.0%
7
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.8%
3.6%
8
Nature Communications
4913 papers in training set
Top 42%
3.1%
9
JAMA Network Open
127 papers in training set
Top 1%
3.1%
10
Journal of Biomedical Informatics
45 papers in training set
Top 0.5%
3.1%
11
The Lancet Digital Health
25 papers in training set
Top 0.2%
2.4%
12
PLOS ONE
4510 papers in training set
Top 48%
2.1%
13
BMC Medical Research Methodology
43 papers in training set
Top 0.4%
2.1%
14
JMIR Public Health and Surveillance
45 papers in training set
Top 1%
1.9%
15
Annals of Internal Medicine
27 papers in training set
Top 0.3%
1.9%
16
BMJ Health & Care Informatics
13 papers in training set
Top 0.3%
1.9%
17
Scientific Reports
3102 papers in training set
Top 57%
1.7%
18
CMAJ Open
12 papers in training set
Top 0.1%
1.3%
19
Frontiers in Digital Health
20 papers in training set
Top 0.9%
1.2%
20
Frontiers in Public Health
140 papers in training set
Top 6%
1.2%
21
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
22
JMIRx Med
31 papers in training set
Top 2%
0.8%
23
Biomedicines
66 papers in training set
Top 3%
0.7%
24
International Journal of Cancer
42 papers in training set
Top 1%
0.7%
25
International Journal of Medical Informatics
25 papers in training set
Top 2%
0.5%