학술논문

${\sf FederBoost}$: Private Federated Learning for GBDT
Document Type
Periodical
Source
IEEE Transactions on Dependable and Secure Computing IEEE Trans. Dependable and Secure Comput. Dependable and Secure Computing, IEEE Transactions on. 21(3):1274-1285 Jun, 2024
Subject
Computing and Processing
Decision trees
Training
Federated learning
Data models
Prediction algorithms
Cryptography
Boosting
federated learning
GBDT
privacy
Language
ISSN
1545-5971
1941-0018
2160-9209
Abstract
Federated Learning (FL) has been an emerging trend in machine learning and artificial intelligence. It allows multiple participants to collaboratively train a better global model and offers a privacy-aware paradigm for model training since it does not require participants to release their original training data. However, existing FL solutions for vertically partitioned data or decision trees require heavy cryptographic operations. In this article, we propose a framework named $\mathsf {FederBoost}$FederBoost for private federated learning of gradient boosting decision trees (GBDT). It supports running GBDT over both vertically and horizontally partitioned data. Vertical $\mathsf {FederBoost}$FederBoost does not require any cryptographic operation and horizontal $\mathsf {FederBoost}$FederBoost only requires lightweight secure aggregation. The key observation is that the whole training process of GBDT relies on the ordering of the data instead of the values. We fully implement $\mathsf {FederBoost}$FederBoost and evaluate its utility and efficiency through extensive experiments performed on three public datasets. Our experimental results show that both vertical and horizontal $\mathsf {FederBoost}$FederBoost achieve the same level of accuracy with centralized training where all data are collected in a central server; and they are 4-5 orders of magnitude faster than the state-of-the-art solutions for federated decision tree training; hence offering practical solutions for industrial applications.