SurvTEDVAE: A Disentangled Variational Autoencoder for Heterogeneous Treatment Effect Estimation with Time-to-Event Outcomes
Powell, W. J. B.; Zhang, L.
Show abstract
Estimating heterogeneous treatment effects (HTE) from observational health data is essential for precision medicine, yet existing methods often struggle with high-dimensional covariates and time-to-event outcomes common in electronic health records (EHRs). We propose SurvTEDVAE, a disentangled variational autoencoder designed for causal survival analysis. The model learns latent representations corresponding to instrumental factors, confounders, and outcome-dependent risk factors, and integrates a survival likelihood to model time-to-event outcomes with censoring. The learned representations are used to estimate conditional average treatment effects using downstream causal estimators. We evaluated SurvTEDVAE using a semi-synthetic ACTG dataset and a high-dimensional EHR-based hypertension cohort with over 20,000 covariates. Across both datasets, SurvTEDVAE achieved lower estimation error for heterogeneous treatment effects compared with meta-learning and causal survival forest approaches. These results demonstrate that disentangled representation learning can improve causal effect estimation for survival outcomes in high-dimensional real-world health data.
Matching journals
The top 6 journals account for 50% of the predicted probability mass.