OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation

Creators: Ren, Hongyu; Zhu, Yuke; Leskovec, Jure; Anandkumar, Anima; Garg, Animesh

Style

An error occurred while generating the citation.

Abstract

Real-world tasks often exhibit a compositional structure that contains a sequence of simpler sub-tasks. For instance, opening a door requires reaching, grasping, rotating, and pulling the door knob. Such compositional tasks require an agent to reason about the sub-task at hand while orchestrating global behavior accordingly. This can be cast as an online task inference problem, where the current task identity, represented by a context variable, is estimated from the agent's past experiences with probabilistic inference. Previous approaches have employed simple latent distributions, e.g., Gaussian, to model a single context for the entire task. However, this formulation lacks the expressiveness to capture the composition and transition of the sub-tasks. We propose a variational inference framework OCEAN to perform online task inference for compositional tasks. OCEAN models global and local context variables in a joint latent space, where the global variables represent a mixture of sub-tasks required for the task, while the local variables capture the transitions between the sub-tasks. Our framework supports flexible latent distributions based on prior knowledge of the task structure and can be trained in an unsupervised manner. Experimental results show that OCEAN provides more effective task inference with sequential context adaptation and thus leads to a performance boost on complex, multi-stage tasks.

Additional Information

© The authors and PMLR 2020. A.G. is a CIFAR AI chair and also acknowledges Vector Institute for computing support. J. L. is a Chan Zuckerberg Biohub investigator. We gratefully acknowledge the support of DARPA under Nos. FA865018C7880 (ASED), N660011924033 (MCS); ARO under Nos. W911NF-16-1-0342 (MURI), W911NF-16-1-0171 (DURIP); NSF under Nos. OAC-1835598 (CINES), OAC-1934578 (HDR), CCF-1918940 (Expeditions), IIS-2030477 (RAPID); Stanford Data Science Initiative, Wu Tsai Neurosciences Institute, Chan Zuckerberg Biohub, Amazon, Boeing, Chase, Docomo, Hitachi, Huawei, JD.com, NVIDIA, Dell. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views, policies, or endorsements, either expressed or implied, of DARPA, NIH, ARO, or the U.S. Government.

Attached Files

Published - ren20a.pdf

Accepted Version - 2008.07087.pdf

Supplemental Material - ren20a-supp.pdf

Files

ren20a-supp.pdf

Files (1.7 MB)

Name	Size	Download all
ren20a-supp.pdf md5:81e427a6ab7b921b52e058b0446bd428	37.7 kB	Preview Download
ren20a.pdf md5:9302a8a7efff02e48d0d71c6120e9ce0	824.5 kB	Preview Download
2008.07087.pdf md5:9f88fd90777df498eb73d3528d29b207	867.0 kB	Preview Download

Additional details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes