Back to Search
Start Over
When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits
- Source :
- Advances in Neural Information Processing Systems, Advances in Neural Information Processing Systems, Dec 2022, New Orleans, United States
- Publication Year :
- 2022
- Publisher :
- arXiv, 2022.
-
Abstract
- We study the problem of multi-armed bandits with $\epsilon$-global Differential Privacy (DP). First, we prove the minimax and problem-dependent regret lower bounds for stochastic and linear bandits that quantify the hardness of bandits with $\epsilon$-global DP. These bounds suggest the existence of two hardness regimes depending on the privacy budget $\epsilon$. In the high-privacy regime (small $\epsilon$), the hardness depends on a coupled effect of privacy and partial information about the reward distributions. In the low-privacy regime (large $\epsilon$), bandits with $\epsilon$-global DP are not harder than the bandits without privacy. For stochastic bandits, we further propose a generic framework to design a near-optimal $\epsilon$ global DP extension of an index-based optimistic bandit algorithm. The framework consists of three ingredients: the Laplace mechanism, arm-dependent adaptive episodes, and usage of only the rewards collected in the last episode for computing private statistics. Specifically, we instantiate $\epsilon$-global DP extensions of UCB and KL-UCB algorithms, namely AdaP-UCB and AdaP-KLUCB. AdaP-KLUCB is the first algorithm that both satisfies $\epsilon$-global DP and yields a regret upper bound that matches the problem-dependent lower bound up to multiplicative constants.<br />Comment: Appears in NeurIPS 2022. From v1, the minimax lower bound for linear bandits is changed to $O(\max(d \sqrt{T}, d/\epsilon))$
- Subjects :
- FOS: Computer and information sciences
Computer Science - Machine Learning
Computer Science - Cryptography and Security
Mathematics - Statistics Theory
Machine Learning (stat.ML)
Statistics Theory (math.ST)
Linear bandits
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Machine Learning (cs.LG)
Differential privacy
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]
[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
[INFO.INFO-CY]Computer Science [cs]/Computers and Society [cs.CY]
Statistics - Machine Learning
[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]
[INFO.INFO-IT]Computer Science [cs]/Information Theory [cs.IT]
FOS: Mathematics
Multiarmed bandit
Regret Bounds
UCB policies
Cryptography and Security (cs.CR)
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Advances in Neural Information Processing Systems, Advances in Neural Information Processing Systems, Dec 2022, New Orleans, United States
- Accession number :
- edsair.doi.dedup.....ad7b5c43efbfb8e91a03d474425f1f8d
- Full Text :
- https://doi.org/10.48550/arxiv.2209.02570