Partial model averaging in Federated Learning: Performance guarantees and benefits

Citations

WEB OF SCIENCE

9
Citations

SCOPUS

14

초록

Local Stochastic Gradient Descent (SGD) with periodic model averaging (FedAvg) is a foundational algorithm in Federated Learning. The algorithm independently runs SGD on multiple clients and periodically averages the model across all the clients. This periodic model averaging potentially causes a significant model discrepancy across the clients making the global loss converge slowly. While recent advanced optimization methods tackle the issue focused on non-IID settings, there still exists the model discrepancy issue due to the underlying periodic model averaging. We propose a partial model averaging framework that mitigates the model discrepancy issue in Federated Learning. The partial averaging encourages the local models to stay close to each other on parameter space, and it enables to more effectively minimize the global loss. We extensively evaluate the performance of the partial averaging strategy using CIFAR-10/100 and FEMNIST benchmarks. Given a fixed number of training iterations and a large number of clients (128), the partial averaging achieves up to 2.2% higher accuracy than the periodic full averaging.

키워드

Federated LearningPartial model aggregationModel discrepancy
제목
Partial model averaging in Federated Learning: Performance guarantees and benefits
저자
Lee, SunwooSahu, Anit KumarHe, ChaoyangAvestimehr, Salman
DOI
10.1016/j.neucom.2023.126647
발행일
2023-11-01
유형
Article
저널명
Neurocomputing
556