Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published July 2021 | Supplemental Material + Accepted Version + Published
Journal Article Open

Density Constrained Reinforcement Learning

Abstract

We study constrained reinforcement learning (CRL) from a novel perspective by setting constraints directly on state density functions, rather than the value functions considered by previous works. State density has a clear physical and mathematical interpretation, and is able to express a wide variety of constraints such as resource limits and safety requirements. Density constraints can also avoid the time-consuming process of designing and tuning cost functions required by value function-based constraints to encode system specifications. We leverage the duality between density functions and Q functions to develop an effective algorithm to solve the density constrained RL problem optimally and the constrains are guaranteed to be satisfied. We prove that the proposed algorithm converges to a near-optimal solution with a bounded error even when the policy update is imperfect. We use a set of comprehensive experiments to demonstrate the advantages of our approach over state-of-the-art CRL methods, with a wide range of density constrained tasks as well as standard CRL benchmarks such as Safety-Gym.

Additional Information

© 2021 by the author(s). The authors acknowledge support from the DARPA Assured Autonomy under contract FA8750-19-C-0089 and from the Defense Science and Technology Agency in Singapore. The views, opinions, and/or findings expressed are those of the authors and should not be interpreted as representing the official views or policies of the Department of Defense, the U.S. Government, DSTA Singapore, or the Singapore Government.

Attached Files

Published - qin21a.pdf

Accepted Version - 2106.12764.pdf

Supplemental Material - qin21a-supp.pdf

Files

qin21a.pdf
Files (7.8 MB)
Name Size Download all
md5:b983532de47b2c50fccc0aa74eec42e8
2.6 MB Preview Download
md5:0487568885880099b8ff48ba7699f192
1.3 MB Preview Download
md5:161dbce22e2505bf130a2f158b73cf22
3.8 MB Preview Download

Additional details

Created:
August 20, 2023
Modified:
October 24, 2023