Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published December 20, 2022 | Submitted
Report Open

R-CNNs for Pose Estimation and Action Detection

Abstract

We present convolutional neural networks for the tasks of keypoint (pose) prediction and action classification of people in unconstrained images. Our approach involves training an R-CNN detector with loss functions depending on the task being tackled. We evaluate our method on the challenging PASCAL VOC dataset and compare it to previous leading approaches. Our method gives state-of-the-art results for keypoint and action prediction. Additionally, we introduce a new dataset for action detection, the task of simultaneously localizing people and classifying their actions, and present results using our approach.

Additional Information

This work was supported by the Intel Visual Computing Center, ONR SMARTS MURI N000140911051, ONR MURI N000141010933, a Google Research Grant and a Microsoft Research fellowship. The GPUs used in this research were generously donated by the NVIDIA Corporation.

Attached Files

Submitted - 1406.5212.pdf

Files

1406.5212.pdf
Files (10.5 MB)
Name Size Download all
md5:9ea1a76cdd7c9bd071cd04824ab671ae
10.5 MB Preview Download

Additional details

Created:
August 20, 2023
Modified:
October 24, 2023