[edit]
Open Problem: Optimal Rates for Stochastic Decision-Theoretic Online Learning Under Differentially Privacy
Proceedings of Thirty Seventh Conference on Learning Theory, PMLR 247:5330-5334, 2024.
Abstract
For the stochastic variant of decision-theoretic online learning with K actions, T rounds, and minimum gap Δmin, the optimal, gap-dependent rate of the pseudo-regret is known to be O \left( \frac{\log K}{\Delta_{\min}} \right). We ask to settle the optimal gap-dependent rate for the problem under \varepsilon-differential privacy.