Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions
- Creators
- Hong, Joey
- Sapp, Benjamin
- Philbin, James
Abstract
We focus on the problem of predicting future states of entities in complex, real-world driving scenarios. Previous research has approached this problem via low-level signals to predict short time horizons, and has not addressed how to leverage key assets relied upon heavily by industry self-driving systems: (1) large 3D perception efforts which provide highly accurate 3D states of agents with rich attributes, and (2) detailed and accurate semantic maps of the environment (lanes, traffic lights, crosswalks, etc). We present a unified representation which encodes such high-level semantic information in a spatial grid, allowing the use of deep convolutional models to fuse complex scene context. This enables learning entity-entity and entity-environment interactions with simple, feed-forward computations in each timestep within an overall temporal model of an agent's behavior. We propose different ways of modelling the future as a distribution over future states using standard supervised learning. We introduce a novel dataset providing industry-grade rich perception and semantic inputs, and empirically show we can effectively learn fundamentals of driving behavior.
Additional Information
© 2019 IEEE. The authors would also like to thank and acknowledge Kai Wang for his work on this project. Kai has been instrumental in coordinating the final version of the paper and preparing the dataset for release.Attached Files
Accepted Version - 1906.08945.pdf
Files
Name | Size | Download all |
---|---|---|
md5:04b7ce38ec42d9c96726960af0270616
|
1.1 MB | Preview Download |
Additional details
- Eprint ID
- 100769
- Resolver ID
- CaltechAUTHORS:20200116-142937887
- Created
-
2020-01-16Created from EPrint's datestamp field
- Updated
-
2021-11-16Created from EPrint's last_modified field