Online Optimization with Feedback Delay and Nonlinear Switching Cost
- Creators
- Pan, Weici
-
Shi, Guanya
-
Lin, Yiheng
-
Wierman, Adam
Abstract
We study a variant of online optimization in which the learner receives k-round delayed feedback about hitting cost and there is a multi-step nonlinear switching cost, i.e., costs depend on multiple previous actions in a nonlinear manner. Our main result shows that a novel Iterative Regularized Online Balanced Descent (iROBD) algorithm has a constant, dimension-free competitive ratio that is O(L^(2k)), where L is the Lipschitz constant of the switching cost. Additionally, we provide lower bounds that illustrate the Lipschitz condition is required and the dependencies on k and L are tight. Finally, via reductions, we show that this setting is closely related to online control problems with delay, nonlinear dynamics, and adversarial disturbances, where iROBD directly offers constant-competitive online policies.
Additional Information
© 2022 Owner/Author. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).Attached Files
Published - 3508037.pdf
Submitted - 2111.00095.pdf
Files
Name | Size | Download all |
---|---|---|
md5:69d289920c36ff71a058bada9fdce28f
|
821.8 kB | Preview Download |
md5:fc53dbb975878ff7eb2cd05455eeb5e0
|
810.2 kB | Preview Download |
Additional details
- Eprint ID
- 113677
- Resolver ID
- CaltechAUTHORS:20220302-699021182
- Created
-
2022-03-02Created from EPrint's datestamp field
- Updated
-
2022-12-23Created from EPrint's last_modified field