Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published June 2000 | public
Book Section - Chapter

Towards Automatic Discovery of Object Categories

Abstract

We propose a method to learn heterogeneous models of object classes for visual recognition. The training images contain a preponderance of clutter and learning is unsupervised. Our models represent objects as probabilistic constellations of rigid parts (features). The variability within a class is represented by a joint probability density function on the shape of the constellation and the appearance of the parts. Our method automatically identifies distinctive features in the training set. The set of model parameters is then learned using expectation maximization (see the companion paper [11] for details). When trained on different, unlabeled and unsegmented views of a class of objects, each component of the mixture model can adapt to represent a subset of the views. Similarly, different component models can also "specialize" on sub-classes of an object class. Experiments on images of human heads, leaves from different species of trees, and motor-cars demonstrate that the method works well over a wide variety of objects.

Additional Information

© 2000 IEEE. Date of Current Version: 06 August 2002. Funded by the NSF Engineering Research Center for Neuromorphic Systems Engineering (CNSE) at Caltech (NSF9402726), and an NSF National Young Investigator Award to P.P. (NSF9457618). M. Welling was supported by the Sloan Foundation. We are also very grateful to Rob Fergus, Catharine Stebbins and Justin Smith for helping with collecting the databases. We are grateful to Thomas Leung, Mike Burl, Jitendra Malik and David Forsyth for many helpful comments.

Additional details

Created:
August 19, 2023
Modified:
January 13, 2024