OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation
Abstract
Real-world tasks often exhibit a compositional structure that contains a sequence of simpler sub-tasks. For instance, opening a door requires reaching, grasping, rotating, and pulling the door knob. Such compositional tasks require an agent to reason about the sub-task at hand while orchestrating global behavior accordingly. This can be cast as an online task inference problem, where the current task identity, represented by a context variable, is estimated from the agent's past experiences with probabilistic inference. Previous approaches have employed simple latent distributions, e.g., Gaussian, to model a single context for the entire task. However, this formulation lacks the expressiveness to capture the composition and transition of the sub-tasks. We propose a variational inference framework OCEAN to perform online task inference for compositional tasks. OCEAN models global and local context variables in a joint latent space, where the global variables represent a mixture of sub-tasks required for the task, while the local variables capture the transitions between the sub-tasks. Our framework supports flexible latent distributions based on prior knowledge of the task structure and can be trained in an unsupervised manner. Experimental results show that OCEAN provides more effective task inference with sequential context adaptation and thus leads to a performance boost on complex, multi-stage tasks.
Additional Information
© The authors and PMLR 2020. A.G. is a CIFAR AI chair and also acknowledges Vector Institute for computing support. J. L. is a Chan Zuckerberg Biohub investigator. We gratefully acknowledge the support of DARPA under Nos. FA865018C7880 (ASED), N660011924033 (MCS); ARO under Nos. W911NF-16-1-0342 (MURI), W911NF-16-1-0171 (DURIP); NSF under Nos. OAC-1835598 (CINES), OAC-1934578 (HDR), CCF-1918940 (Expeditions), IIS-2030477 (RAPID); Stanford Data Science Initiative, Wu Tsai Neurosciences Institute, Chan Zuckerberg Biohub, Amazon, Boeing, Chase, Docomo, Hitachi, Huawei, JD.com, NVIDIA, Dell. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views, policies, or endorsements, either expressed or implied, of DARPA, NIH, ARO, or the U.S. Government.Attached Files
Published - ren20a.pdf
Accepted Version - 2008.07087.pdf
Supplemental Material - ren20a-supp.pdf
Files
Additional details
- Eprint ID
- 106483
- Resolver ID
- CaltechAUTHORS:20201106-120151731
- Canadian Institute for Advanced Research (CIFAR)
- Chan-Zuckerberg Biohub
- Defense Advanced Research Projects Agency (DARPA)
- FA865018C7880
- Defense Advanced Research Projects Agency (DARPA)
- N660011924033
- Army Research Office (ARO)
- W911NF-16-1-0342
- Army Research Office (ARO)
- W911NF-16-1-0171
- NSF
- OAC-1835598
- NSF
- OAC-1934578
- NSF
- CCF-1918940
- NSF
- IIS-2030477
- Stanford University
- Wu Tsai Neurosciences Institute
- Amazon
- Boeing Corporation
- Chase Manhattan Bank
- Docomo
- Hitachi
- Huawei
- JD.com
- NVIDIA Corporation
- Dell Inc.
- Created
-
2020-11-06Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field