Start Over

Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints.

Authors :: Yi, Xinlei
Li, Xiuxian
Yang, Tao
Xie, Lihua
Chai, Tianyou
Johansson, Karl Henrik
Source :: IEEE Transactions on Automatic Control. Oct2021, Vol. 66 Issue 10, p4620-4635. 16p.
Publication Year :: 2021
Abstract: Distributed bandit online convex optimization with time-varying coupled inequality constraints is considered, motivated by a repeated game between a group of learners and an adversary. The learners attempt to minimize a sequence of global loss functions and at the same time satisfy a sequence of coupled constraint functions, where the constraints are coupled across the distributed learners at each round. The global loss and the coupled constraint functions are the sum of local convex loss and constraint functions, respectively, which are adaptively generated by the adversary. The local loss and constraint functions are revealed in a bandit manner, i.e., only the values of loss and constraint functions are revealed to the learners at the sampling instance, and the revealed function values are held privately by each learner. Both one- and two-point bandit feedback are studied with the two corresponding distributed bandit online algorithms used by the learners. We show that sublinear expected regret and constraint violation are achieved by these two algorithms, if the accumulated variation of the comparator sequence also grows sublinearly. In particular, we show that $\mathcal {O}(T^{\theta })$ expected static regret and $\mathcal {O}(T^{7/4-\theta })$ constraint violation are achieved in the one-point bandit feedback setting, and $\mathcal {O}(T^{\max \lbrace \kappa,1-\kappa \rbrace })$ expected static regret and $\mathcal {O}(T^{1-\kappa /2})$ constraint violation in the two-point bandit feedback setting, where $\theta \in (3/4,5/6]$ and $\kappa \in (0,1)$ are user-defined tradeoff parameters. Finally, the tightness of the theoretical results is illustrated by numerical simulations of a simple power grid example, which also compares the proposed algorithms to algorithms existing in the literature. [ABSTRACT FROM AUTHOR]

Subjects :: *ROBBERS
*ONLINE algorithms
*ELECTRIC power distribution grids
*DISTRIBUTED algorithms
*APPROXIMATION algorithms
*HEURISTIC algorithms

Details

Language :: English
ISSN :: 00189286
Volume :: 66
Issue :: 10
Database :: Academic Search Index
Journal :: IEEE Transactions on Automatic Control
Publication Type :: Periodical
Accession number :: 153732244
Full Text :: https://doi.org/10.1109/TAC.2020.3030883

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources