Describing Common Human Visual Actions in Images

Creators: Ronchi, Matteo Ruggero; Perona, Pietro

Others:: Xie, Xianghua; Jones, Mark W.; Tam, Gary K. L.

Style

An error occurred while generating the citation.

Abstract

Which common human actions and interactions are recognizable in monocular still images? Which involve objects and/or other people? How many is a person performing at a time? We address these questions by exploring the actions and interactions that are detectable in the images of the MS COCO dataset. We make two main contributions. First, a list of 140 common 'visual actions', obtained by analyzing the largest online verb lexicon currently available for English (VerbNet) and human sentences used to describe images in MS COCO. Second, a complete set of annotations for those 'visual actions', composed of subject-object and associated verb, which we call COCO-a (a for 'actions'). COCO-a is larger than existing action datasets in terms of number instances of actions, and is unique because it is data-driven, rather than experimenter-biased. Other unique features are that it is exhaustive, and that all subjects and objects are localized. A statistical analysis of the accuracy of our annotations and of each action, interaction and subject-object combination is provided.

Additional Information

Attached Files

Published - BMVC15_DescribingCommonVisualActions_PAPER.pdf

Submitted - 1506.02203.pdf

Supplemental Material - BMVC15_DescribingCommonVisualActions_SUPP.pdf

Supplemental Material - sup052.zip

Files

sup052.zip

Files (18.5 MB)

Name	Size	Download all
sup052.zip md5:2167e30007c9b501e4441ec0efc29ca9	1.3 MB	Preview Download
1506.02203.pdf md5:2586049555f6cbfb36942e54f2209c1e	9.0 MB	Preview Download
BMVC15_DescribingCommonVisualActions_PAPER.pdf md5:968e05182886336c3e1d098f5ce66ad4	6.8 MB	Preview Download
BMVC15_DescribingCommonVisualActions_SUPP.pdf md5:7624522c76b3c545cbfbeee83493d8f6	1.4 MB	Preview Download

Additional details

Views

Downloads

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes

More info on how stats are collected....

Resource type: Book Section - Chapter
Publisher: BMVA Press
Imprint: British Machine Vision Conference 2015, 52.1-52.12. Durham, UK. ISBN: 1901725537.
Conference: British Machine Vision Conference (BMVC 2015) , Swansea, UK, 7-10 September 2015