Real-world EHR-derived progression-free survival across successive lines of therapy informs metastatic breast cancer risk stratification
Zhao, X.; Niederhauser, T.; Balazs, Z.; Wicki, A.; Fan, B.; Krauthammer, M.
Show abstract
Guideline-based recommendations for metastatic lines of therapy (mLoTs), especially second lines and beyond, are comparatively sparse due to challenges in later-line treatment efficacy quantification. Scalable real-world evidence that captures the interaction between treatment and disease progression is therefore especially valuable, as regimens become increasingly individualized, confounding intensifies, and progression is rarely recorded as a structured EHR endpoint. We present a framework to (i) reconstruct clinically coherent mLoTs from longitudinal EHR using radiology-anchored progression evidence and (ii) generate individualized progression-free survival (PFS) estimates from a line-start multimodal snapshot in a highly heterogeneous cohort. In 2,881 patients contributing 8,791 metastatic mLoTs, the selected model shows strong discrimination over a 2-year horizon (Antolinis C = 0.680 {+/-} 0.006; cumulative/dynamic AUC at 1 year = 0.824 {+/-} 0.006). Predicted risk strata closely track Kaplan-Meier trends across line number and tumor subtypes, enabling calibrated risk stratification even in smaller sub-cohorts. Model prediction primarily relies on clinically plausible signals of recent metastatic burden and tumor markers, with limited dependence on surveillance cadence or subtype labels, and is robust to missingness. Together, this framework supports scalable evidence generation and interpretable, calibrated prognostication to inform risk assessment and care planning in heterogeneous metastatic practice.
Matching journals
The top 7 journals account for 50% of the predicted probability mass.