Back

Federated penalized piecewise exponential model for horizontally distributed survival data: FedPPEM

Islam, N.; Luo, C.; Tong, J.; Polleya, D. A.; Jordan, C. T.; Haverkos, B.; Bair, S.; Kent, A.; Weller, G.

2026-02-12 health informatics
10.64898/2026.02.11.26346054 medRxiv
Show abstract

Cox proportional hazard regressions are frequently employed to develop prognostic models for time-to-event data, considering both patient-specific and disease-specific characteristics. In high-dimensional clinical modeling, these biological features can exhibit high collinearity due to inter-feature relationships, potentially causing instability and numerical issues during estimation without regularization. For rare diseases such as acute myeloid leukemia (AML), the sparsity and scarcity of data further complicate estimation. In such cases, data augmentation through multi-site collaboration can alleviate these problems. However, this often necessitates sharing individual patient data (IPD) across sites, which presents challenges due to regulatory barriers aimed at protecting patient privacy. To overcome these challenges, we propose a privacy-preserving algorithm that eliminates sharing IPD across sites and fits a federated penalized piecewise exponential model (FedPPEM) to estimate potential effects of clinical features using summary statistics. This algorithm yields results nearly identical to those from pooled IPD, including effect size and standard error estimates. We demonstrate the models performance in quantifying effects of clinical features and genetic risk classification on overall survival using real-world data from [~]1200 newly diagnosed AML patients across 33 U.S. sites. Although applied in AML context, this model is disease-agnostic and can be implemented in other diseases and clinical contexts.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.1%
9.1%
2
Bioinformatics
1061 papers in training set
Top 3%
8.4%
3
PLOS ONE
4510 papers in training set
Top 28%
6.3%
4
Journal of Biomedical Informatics
45 papers in training set
Top 0.2%
6.3%
5
Scientific Reports
3102 papers in training set
Top 19%
6.3%
6
Patterns
70 papers in training set
Top 0.1%
4.8%
7
Nature Computational Science
50 papers in training set
Top 0.1%
4.3%
8
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.7%
3.9%
9
Nature Communications
4913 papers in training set
Top 40%
3.6%
50% of probability mass above
10
Communications Biology
886 papers in training set
Top 3%
2.7%
11
npj Digital Medicine
97 papers in training set
Top 2%
2.3%
12
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
13
Journal of Medical Internet Research
85 papers in training set
Top 2%
2.1%
14
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.4%
2.1%
15
Science Advances
1098 papers in training set
Top 15%
1.9%
16
iScience
1063 papers in training set
Top 12%
1.9%
17
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.2%
1.9%
18
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
19
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.7%
20
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
21
Medical Image Analysis
33 papers in training set
Top 0.8%
1.2%
22
JMIR Medical Informatics
17 papers in training set
Top 1%
1.2%
23
Advanced Science
249 papers in training set
Top 16%
0.9%
24
JAMIA Open
37 papers in training set
Top 1%
0.8%
25
Journal of Personalized Medicine
28 papers in training set
Top 1%
0.8%
26
Nature Machine Intelligence
61 papers in training set
Top 3%
0.8%
27
Physical Biology
43 papers in training set
Top 2%
0.8%
28
Bulletin of Mathematical Biology
84 papers in training set
Top 2%
0.7%
29
Expert Systems with Applications
11 papers in training set
Top 0.5%
0.7%
30
PNAS Nexus
147 papers in training set
Top 2%
0.7%