Neural Modular Control for Embodied Question Answering
Abstract
We present a modular approach for learning policies for navigation over long planning horizons from language input. Our hierarchical policy operates at multiple timescales, where the higher-level master policy proposes subgoals to be executed by specialized sub-policies. Our choice of subgoals is compositional and semantic, i.e. they can be sequentially combined in arbitrary orderings, and assume human-interpretable descriptions (e.g. 'exit room', 'find kitchen', 'find refrigerator', etc.). We use imitation learning to warm-start policies at each level of the hierarchy, dramatically increasing sample efficiency, followed by reinforcement learning. Independent reinforcement learning at each level of hierarchy enables sub-policies to adapt to consequences of their actions and recover from errors. Subsequent joint hierarchical training enables the master policy to adapt to the sub-policies. On the challenging EQA (Das et al., 2018) benchmark in House3D (Wu et al., 2018), requiring navigating diverse realistic indoor environments, our approach outperforms prior work by a significant margin, both in terms of navigation and question answering.
Additional Information
This work was supported in part by NSF, AFRL, DARPA, Siemens, Google, Amazon, ONR YIPs and ONR Grants N00014-16-1-{2713,2793}. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the U.S. Government, or any sponsor.Attached Files
Accepted Version - 1810.11181.pdf
Files
Name | Size | Download all |
---|---|---|
md5:8eec6241f27047a5a0abc6e937bac94c
|
1.9 MB | Preview Download |
Additional details
- Eprint ID
- 118430
- Resolver ID
- CaltechAUTHORS:20221219-204822934
- NSF
- Air Force Research Laboratory (AFRL)
- Defense Advanced Research Projects Agency (DARPA)
- Siemens
- Amazon
- Office of Naval Research (ONR)
- N00014-16-1-2713
- Office of Naval Research (ONR)
- N00014-16-1-2793
- Created
-
2022-12-20Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field