Back

PrivateBoost: Privacy-Preserving Federated Gradient Boosting for Cross-Device Medical Data

Specht, B.; Garbaya, S.; Ermis, O.; Schneider, R.; Chavarriaga, R.; Khadraoui, D.; Tayeb, Z.

2026-03-10 health informatics
10.64898/2026.02.10.26345891 medRxiv
Show abstract

Cross-device medical federated learning where individual patients participate directly rather than institutions poses a unique challenge: each client holds only a few samples, often just one (e.g., a single diagnostic record), leaving insufficient local data for gradient computation. Existing approaches, such as Secure Aggregation, require client-to-client coordination impractical for intermittently available mobile devices, while homomorphic encryption-based alternatives introduce sophisticated key management and coordination requirements ill-suited to dynamic cross-device deployments. We present privateboost, a federated XGBoost system that addresses this setting through m-of-n Shamir secret sharing with commitment-based anonymous aggregation. Clients distribute shares to a fixed set of shareholders requiring no client-to-client communication and the aggregator reconstructs only aggregate gradient sums via Lagrange interpolation, never observing individual values or client identities. We evaluate on UCI medical datasets, demonstrating 98% split gain retention relative to centralized XGBoost and accuracy resilient to up to 80% client dropout.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 2%
26.7%
2
Nature Medicine
117 papers in training set
Top 0.2%
7.0%
3
Nature Biomedical Engineering
42 papers in training set
Top 0.1%
6.6%
4
Nature Computational Science
50 papers in training set
Top 0.1%
6.5%
5
Science Advances
1098 papers in training set
Top 3%
4.5%
50% of probability mass above
6
Scientific Reports
3102 papers in training set
Top 26%
4.4%
7
npj Digital Medicine
97 papers in training set
Top 1%
4.3%
8
Science Translational Medicine
111 papers in training set
Top 1%
3.2%
9
Bioinformatics
1061 papers in training set
Top 6%
2.4%
10
Cell Systems
167 papers in training set
Top 6%
2.1%
11
PLOS ONE
4510 papers in training set
Top 47%
2.1%
12
Nature Machine Intelligence
61 papers in training set
Top 2%
1.8%
13
PLOS Digital Health
91 papers in training set
Top 1%
1.7%
14
Nature Methods
336 papers in training set
Top 4%
1.7%
15
Patterns
70 papers in training set
Top 1%
1.5%
16
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 40%
1.0%
17
Communications Biology
886 papers in training set
Top 18%
0.9%
18
Nature Biotechnology
147 papers in training set
Top 7%
0.9%
19
Journal of Medical Internet Research
85 papers in training set
Top 4%
0.8%
20
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.8%
0.8%
21
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
22
Communications Medicine
85 papers in training set
Top 1.0%
0.8%
23
PNAS Nexus
147 papers in training set
Top 1%
0.8%
24
Neuron
282 papers in training set
Top 8%
0.7%
25
Genome Research
409 papers in training set
Top 4%
0.7%
26
iScience
1063 papers in training set
Top 32%
0.7%
27
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.7%
28
JMIR Medical Informatics
17 papers in training set
Top 2%
0.7%
29
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%
30
Nature
575 papers in training set
Top 17%
0.7%