Robust Reinforcement Learning: A Constrained Game-theoretic Approach
Abstract
Deep reinforcement learning (RL) methods provide state-of-art performance in complex control tasks. However, it has been widely recognized that RL methods often fail to generalize due to unaccounted uncertainties. In this work, we propose a game theoretic framework for robust reinforcement learning that comprises many previous works as special cases. We formulate robust RL as a constrained minimax game between the RL agent and an environmental agent which represents uncertainties such as model parameter variations and adversarial disturbances. To solve the competitive optimization problems arising in our framework, we propose to use competitive mirror descent (CMD). This method accounts for the interactive nature of the game at each iteration while using Bregman divergences to adapt to the global structure of the constraint set. We demonstrate an RRL policy gradient algorithm that leverages Lagrangian duality and CMD. We empirically show that our algorithm is stable for large step sizes, resulting in faster convergence on linear quadratic games.
Additional Information
© 2021 J. Yu, C. Gehring, F. Schäfer & A. Anandkumar. We thank the anonymous referees for their valuable feedback. CG gratefully acknowledges support from NSF grant 1723381; from AFOSR grant FA9550-17-1-0165; from ONR grant N00014-18-1-2847 and from the MIT-IBM Watson Lab. FS gratefully acknowledges support by the Air Force Office of Scientific Research under award number FA9550-18-1-0271 (Games for Computation and Learning) and the Ronald and Maxine Linde Institute of Economic and Management Sciences at Caltech. AA is supported in part by the Bren endowed chair, Microsoft, Google, Facebook and Adobe faculty fellowships.Attached Files
Published - yu21a.pdf
Files
Name | Size | Download all |
---|---|---|
md5:cb0cd01be0f51a82e30e18c634d2a590
|
362.6 kB | Preview Download |
Additional details
- Eprint ID
- 110027
- Resolver ID
- CaltechAUTHORS:20210727-172214672
- NSF
- IIS-1723381
- Air Force Office of Scientific Research (AFOSR)
- FA9550-17-1-0165
- Office of Naval Research (ONR)
- N00014-18-1-2847
- Massachusetts Institute of Technology (MIT)
- Air Force Office of Scientific Research (AFOSR)
- FA9550-18-1-0271
- Linde Institute of Economic and Management Science
- Bren Professor of Computing and Mathematical Sciences
- Microsoft
- Adobe
- Created
-
2021-07-28Created from EPrint's datestamp field
- Updated
-
2021-07-28Created from EPrint's last_modified field