Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published September 2012 | public
Journal Article

Unsupervised Learning of Categorical Segments in Image Collections

Abstract

Which one comes first: segmentation or recognition? We propose a unified framework for carrying out the two simultaneously and without supervision. The framework combines a flexible probabilistic model, for representing the shape and appearance of each segment, with the popular "bag of visual words" model for recognition. If applied to a collection of images, our framework can simultaneously discover the segments of each image and the correspondence between such segments, without supervision. Such recurring segments may be thought of as the "parts" of corresponding objects that appear multiple times in the image collection. Thus, the model may be used for learning new categories, detecting/classifying objects, and segmenting images, without using expensive human annotation.

Additional Information

© 2012 IEEE. Manuscript received 12 Dec. 2009; revised 23 Nov. 2010; accepted 22 Nov. 2011; published online 22 Dec. 2011. Recommended for acceptance by S. Belongie. Date of Publication: 27 December 2011. Date of Current Version: 23 July 2012. The authors would like to thank Greg Griffin and Kristin Branson for reviewing the manuscript and giving many important suggestions for improving it. Funding for this research was provided by ONR-MURI Grant N00014-06-1-0734. Lihi Zelnik-Manor is supported by FP7-IRG grant 2009783.

Additional details

Created:
August 19, 2023
Modified:
October 18, 2023