Online Optimization with Feedback Delay and Nonlinear Switching Cost
- Creators
- Pan, Weici
-
Shi, Guanya
-
Lin, Yiheng
-
Wierman, Adam
Abstract
We study a variant of online optimization in which the learner receives k-round delayed feedback about hitting cost and there is a multi-step nonlinear switching cost, i.e., costs depend on multiple previous actions in a nonlinear manner. Our main result shows that a novel Iterative Regularized Online Balanced Descent (iROBD) algorithm has a constant, dimension-free competitive ratio that is O(L^(2k)), where L is the Lipschitz constant of the nonlinear switching cost. Additionally, we provide lower bounds that illustrate the Lipschitz condition is required and the dependencies on k and L are tight. Finally, via reductions, we show that this setting is closely related to online control problems with delay, nonlinear dynamics, and adversarial disturbances, where iROBD directly offers constant-competitive online policies. This extended abstract is an abridged version of [2].
Additional Information
© 2022 Copyright held by the owner/author(s).Attached Files
Published - 3489048.3522657.pdf
Submitted - 2111.00095.pdf
Files
Name | Size | Download all |
---|---|---|
md5:3c3b5f61251b8563443515ace919de7e
|
867.6 kB | Preview Download |
md5:69d289920c36ff71a058bada9fdce28f
|
821.8 kB | Preview Download |
Additional details
- Eprint ID
- 113733
- Resolver ID
- CaltechAUTHORS:20220304-172341428
- Created
-
2022-03-07Created from EPrint's datestamp field
- Updated
-
2022-06-28Created from EPrint's last_modified field