Arabzadeh, A. and Grant, J.A. and Leslie, D.S. (2024) Federated χ-armed Bandit with Flexible Personalisation. Transactions on Machine Learning Research, 2024. (In Press)
Federated_X_armed_bandit_with_flexible_personalization_TMLR_revision_1_.pdf - Accepted Version
Available under License Creative Commons Attribution.
Download (685kB)
Abstract
This paper introduces a novel approach to personalised federated learning within the X -armed bandit framework, addressing the challenge of optimising both local and global objectives in a highly heterogeneous environment. Our method employs a surrogate objective function that combines individual client preferences with aggregated global knowledge, allowing for a flexible trade-off between personalisation and collective learning. We propose a phasebased elimination algorithm that achieves sublinear regret with logarithmic communication overhead, making it well-suited for federated settings. Theoretical analysis and empirical evaluations demonstrate the effectiveness of our approach compared to existing methods. Potential applications of this work span various domains, including healthcare, smart home devices, and e-commerce, where balancing personalisation with global insights is crucial.
![[thumbnail of Federated_X_armed_bandit_with_flexible_personalization___TMLR__revision_1_]](https://eprints.lancs.ac.uk/style/images/fileicons/text.png)
 Altmetric
 Altmetric Altmetric
 Altmetric