Learning to Control an Unstable System with One Minute of Data: Leveraging Gaussian Process Differentiation in Predictive Control
Abstract
We present a straightforward and efficient way to control unstable robotic systems using an estimated dynamics model. Specifically, we show how to exploit the differentiability of Gaussian Processes to create a state-dependent linearized approximation of the true continuous dynamics that can be integrated with model predictive control. Our approach is compatible with most Gaussian process approaches for system identification, and can learn an accurate model using modest amounts of training data. We validate our approach by learning the dynamics of an unstable system such as a segway with a 7-D state space and 2-D input space (using only one minute of data), and we show that the resulting controller is robust to unmodelled dynamics and disturbances, while state-of-the-art control methods based on nominal models can fail under small perturbations. Code is open sourced at https://github.com/learning-and-control/core.
Additional Information
© 2021 IEEE. This work was supported by NSF awards 1637598, 1645832, 1932091, 1924526, and 1923239, and funding from AeroVironment, JPL and BMW. We would like to thank Andrew Singletary and Ellen Novoseller for their help in setting up the robotics and the mathematical derivations, respectively. We would also like to thank the follwing software packages: PyTorch [43], CVXPY [44] and Gurobi [45].Attached Files
Submitted - 2103.04548.pdf
Files
Name | Size | Download all |
---|---|---|
md5:66ae6cb2565568115fdce53255df71d8
|
3.6 MB | Preview Download |
Additional details
- Alternative title
- Learning Unstable Dynamics with One Minute of Data: A Differentiation-based Gaussian Process Approach
- Eprint ID
- 109029
- Resolver ID
- CaltechAUTHORS:20210510-095024771
- NSF
- CCF-1637598
- NSF
- CNS-1645832
- NSF
- CNS-1932091
- NSF
- ECCS-1924526
- NSF
- CMMI-1923239
- AeroVironment
- JPL
- BMW
- Created
-
2021-05-10Created from EPrint's datestamp field
- Updated
-
2022-04-04Created from EPrint's last_modified field