Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published March 2022 | Submitted + Published
Journal Article Open

Online Optimization with Feedback Delay and Nonlinear Switching Cost

Abstract

We study a variant of online optimization in which the learner receives k-round delayed feedback about hitting cost and there is a multi-step nonlinear switching cost, i.e., costs depend on multiple previous actions in a nonlinear manner. Our main result shows that a novel Iterative Regularized Online Balanced Descent (iROBD) algorithm has a constant, dimension-free competitive ratio that is O(L^(2k)), where L is the Lipschitz constant of the switching cost. Additionally, we provide lower bounds that illustrate the Lipschitz condition is required and the dependencies on k and L are tight. Finally, via reductions, we show that this setting is closely related to online control problems with delay, nonlinear dynamics, and adversarial disturbances, where iROBD directly offers constant-competitive online policies.

Additional Information

© 2022 Owner/Author. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

Attached Files

Published - 3508037.pdf

Submitted - 2111.00095.pdf

Files

2111.00095.pdf
Files (1.6 MB)
Name Size Download all
md5:69d289920c36ff71a058bada9fdce28f
821.8 kB Preview Download
md5:fc53dbb975878ff7eb2cd05455eeb5e0
810.2 kB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 23, 2023