Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative Environments

Creators: Rivière, Benjamin; Hoenig, Wolfgang; Anderson, Matthew; Chung, Soon-Jo

Style

An error occurred while generating the citation.

Abstract

We present a self-improving, Neural Tree Expansion (NTE) method for multi-robot online planning in non-cooperative environments, where each robot attempts to maximize its cumulative reward while interacting with other self-interested robots. Our algorithm adapts the centralized, perfect information, discrete-action space method from AlphaZero to a decentralized, partial information, continuous action space setting for multi-robot applications. Our method has three interacting components: (i) a centralized, perfect-information "expert" Monte Carlo Tree Search (MCTS) with large computation resources that provides expert demonstrations, (ii) a decentralized, partial-information "learner" MCTS with small computation resources that runs in real-time and provides self-play examples, and (iii) policy & value neural networks that are trained with the expert demonstrations and bias both the expert and the learner tree growth. Our numerical experiments demonstrate Neural Tree Expansion's computational advantage by finding better solutions than a MCTS with 20 times more resources. The resulting policies are dynamically sophisticated, demonstrate coordination between robots, and play the Reach-Target-Avoid differential game significantly better than the state-of-the-art control-theoretic baseline for multi-robot, double-integrator systems. Our hardware experiments on an aerial swarm demonstrate the computational advantage of Neural Tree Expansion, enabling online planning at 20 Hz with effective policies in complex scenarios.

Additional Information

© 2021 IEEE. Manuscript received February 24, 2021; accepted June 24, 2021. Date of publication July 14, 2021; date of current version July 26, 2021. This work was supported by the Defense Advanced Research Projects Agency (DARPA). The views, opinions and/or findings expressed are those of the authors, and should not be interpreted as representing the official views or policies of the Department of Defense or the U.S. Government. Preliminary work was in part funded by Raytheon. Video: https://youtu.be/mklbTfWl7DE. Code: https://github.com/bpriviere/decision_making.

Attached Files

Submitted - 2104.09705.pdf

Files

2104.09705.pdf

Files (1.5 MB)

Name	Size	Download all
2104.09705.pdf md5:62cf425197dd049f751235498e2c58fb	1.5 MB	Preview Download

Additional details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes