Back

An End-to-End Synthetic Oncology Clinical Trial Framework Integrating Radiographic Response, Circulating Tumor DNA, Safety, and Survival for Decision-Oriented Clinical Data Science

Petalcorin, M. I. R.

2026-04-08 health informatics
10.64898/2026.04.07.26350297 medRxiv
Show abstract

Background: Modern oncology development depends on integrating radiographic response, molecular biomarkers, treatment exposure, safety, and survival endpoints, yet access to well-structured patient-level trial data is often limited. Methods: We developed a synthetic, literature-informed phase II randomized oncology trial framework that followed the sequence Patient [->] Data [->] Dataset [->] Analysis [->] Tables/Figures [->] Decision. A cohort of randomized patients was simulated with baseline demographic and disease features, longitudinal tumor measurements, circulating tumor DNA, inflammatory and exploratory biomarkers, adverse events, treatment exposure, and survival outcomes. Raw source datasets were transformed into SDTM-like domains and ADaM-like analysis datasets, then analyzed for baseline characteristics, exposure, best overall response, survival, subgroup hazard ratios, longitudinal tumor and biomarker changes, exposure-response, and safety. Results: The treatment arm showed a coherent efficacy signal across multiple analytical layers. Treatment increased objective response and clinical benefit, reduced tumor burden over time, and prolonged survival. Median overall survival increased from 135 days in the control arm to 288 days in the treatment arm, with an approximate hazard ratio of 0.661 (95% CI, 0.480-0.911; p = 0.011). Median progression-free survival increased from 116 to 208 days, with an approximate hazard ratio of 0.601 (95% CI, 0.418-0.864; p = 0.006). Circulating tumor DNA showed a more favorable trajectory in treated patients and aligned directionally with radiographic and survival benefit. Safety analyses showed increased treatment-related toxicity, but the overall safety profile remained interpretable and compatible with continued development. Conclusions: This study demonstrates that a synthetic, literature-informed oncology trial can reproduce a biologically plausible and analytically coherent efficacy-safety signal architecture across radiographic, molecular, and time-to-event endpoints, providing a decision-oriented prototype for translational oncology clinical data science. Keywords: synthetic clinical trial, oncology, ctDNA, Kaplan-Meier, biomarker, survival analysis, translational data science, ADaM, SDTM

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
28.2%
2
npj Digital Medicine
97 papers in training set
Top 0.7%
6.5%
3
Scientific Reports
3102 papers in training set
Top 22%
4.9%
4
Nature Communications
4913 papers in training set
Top 34%
4.4%
5
Bioinformatics
1061 papers in training set
Top 5%
4.0%
6
Cancer Medicine
24 papers in training set
Top 0.3%
3.7%
50% of probability mass above
7
PLOS Computational Biology
1633 papers in training set
Top 11%
3.1%
8
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
2.1%
9
PLOS ONE
4510 papers in training set
Top 47%
2.1%
10
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.9%
11
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.7%
12
Clinical and Translational Science
21 papers in training set
Top 0.4%
1.7%
13
BMC Bioinformatics
383 papers in training set
Top 5%
1.5%
14
Artificial Intelligence in Medicine
15 papers in training set
Top 0.4%
1.5%
15
BMC Medical Research Methodology
43 papers in training set
Top 0.7%
1.5%
16
JCO Precision Oncology
14 papers in training set
Top 0.2%
1.4%
17
JMIR Medical Informatics
17 papers in training set
Top 0.9%
1.4%
18
Informatics in Medicine Unlocked
21 papers in training set
Top 0.7%
1.2%
19
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.5%
1.2%
20
Journal of Personalized Medicine
28 papers in training set
Top 0.7%
1.1%
21
Annals of Internal Medicine
27 papers in training set
Top 0.7%
0.9%
22
Bulletin of Mathematical Biology
84 papers in training set
Top 2%
0.9%
23
Biomedicines
66 papers in training set
Top 2%
0.8%
24
European Respiratory Journal
54 papers in training set
Top 2%
0.8%
25
eBioMedicine
130 papers in training set
Top 4%
0.8%
26
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
27
Cell Reports Medicine
140 papers in training set
Top 8%
0.8%
28
BMJ Health & Care Informatics
13 papers in training set
Top 0.9%
0.7%
29
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.8%
0.7%
30
European Journal of Cancer
10 papers in training set
Top 0.6%
0.7%