Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions

Creators: Hong, Joey; Sapp, Benjamin; Philbin, James

Abstract

We focus on the problem of predicting future states of entities in complex, real-world driving scenarios. Previous research has approached this problem via low-level signals to predict short time horizons, and has not addressed how to leverage key assets relied upon heavily by industry self-driving systems: (1) large 3D perception efforts which provide highly accurate 3D states of agents with rich attributes, and (2) detailed and accurate semantic maps of the environment (lanes, traffic lights, crosswalks, etc). We present a unified representation which encodes such high-level semantic information in a spatial grid, allowing the use of deep convolutional models to fuse complex scene context. This enables learning entity-entity and entity-environment interactions with simple, feed-forward computations in each timestep within an overall temporal model of an agent's behavior. We propose different ways of modelling the future as a distribution over future states using standard supervised learning. We introduce a novel dataset providing industry-grade rich perception and semantic inputs, and empirically show we can effectively learn fundamentals of driving behavior.

Additional Information

© 2019 IEEE. The authors would also like to thank and acknowledge Kai Wang for his work on this project. Kai has been instrumental in coordinating the final version of the paper and preparing the dataset for release.

Attached Files

Accepted Version - 1906.08945.pdf

Files

1906.08945.pdf

Files (1.1 MB)

Name	Size	Download all
1906.08945.pdf md5:04b7ce38ec42d9c96726960af0270616	1.1 MB	Preview Download

Additional details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes