Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks
Abstract
We study the problem of synthesizing a number of likely future frames from a single input image. In contrast to traditional methods, which have tackled this problem in a deterministic or non-parametric way, we propose a novel approach which models future frames in a probabilistic manner. Our proposed method is therefore able to synthesize multiple possible next frames using the same model. Solving this challenging problem involves low- and high-level image and motion understanding for successful image synthesis. Here, we propose a novel network structure, namely a Cross Convolutional Network, that encodes images as feature maps and motion information as convolutional kernels to aid in synthesizing future frames. In experiments, our model performs well on both synthetic data, such as 2D shapes and animated game sprites, as well as on real-wold video data. We show that our model can also be applied to tasks such as visual analogy-making, and present analysis of the learned network representations.
Additional Information
© 2016 Neural Information Processing Systems Foundation. The authors thank Yining Wang for helpful discussions. This work is supported by NSF Robust Intelligence 1212849, NSF Big Data 1447476, ONR MURI 6923196, Adobe, and Shell Research. The authors would also like to thank Nvidia for GPU donation. The first two authors contributed equally to this work.Attached Files
Published - 6552-visual-dynamics-probabilistic-future-frame-synthesis-via-cross-convolutional-networks.pdf
Supplemental Material - 6552-visual-dynamics-probabilistic-future-frame-synthesis-via-cross-convolutional-networks-supplemental.zip
Files
Name | Size | Download all |
---|---|---|
md5:adad6f75026e10cda8dd6ae5d764f1b1
|
17.1 MB | Preview Download |
md5:ab76d40e6ba776e090e433152f991d8d
|
1.3 MB | Preview Download |
Additional details
- Eprint ID
- 94522
- Resolver ID
- CaltechAUTHORS:20190405-161634029
- NSF
- IIS-1212849
- Office of Naval Research
- 6923196
- Adobe
- Shell Research
- nVidia
- Created
-
2019-04-09Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field