Learning object categories from Google's image search
- Creators
- Fergus, R.
- Fei-Fei, L
-
Perona, P.
- Zisserman, A.
Abstract
Current approaches to object category recognition require datasets of training images to be manually prepared, with varying degrees of supervision. We present an approach that can learn an object category from just its name, by utilizing the raw output of image search engines available on the Internet. We develop a new model, TSI-pLSA, which extends pLSA (as applied to visual words) to include spatial information in a translation and scale invariant manner. Our approach can handle the high intra-class variability and large proportion of unrelated images returned by search engines. We evaluate the models on standard test sets, showing performance competitive with existing methods trained on hand prepared datasets.
Additional Information
© 2005 IEEE. Financial support was provided by: EC Project CogViSys; UK EPSRC; Caltech CNSE and the NSF. This work was supported in part by the IST Programme of the European Community, under the PASCAL Network of Excellence, IST-2002-506778. This publication only reflects the au thors' views. Thanks to Rebecca Hoath and Veronica Robles for image labelling. We are indebted to Josef Sivic for his considerable help with many aspects of the paper.Attached Files
Published - 01544937.pdf
Accepted Version - fergus_li_perona_azisser05.pdf
Files
Name | Size | Download all |
---|---|---|
md5:55f2fad137b979822d8affa42549245d
|
2.3 MB | Preview Download |
md5:77f776ce9b43cbf60b2029bc8dbf585e
|
1.9 MB | Preview Download |
Additional details
- Eprint ID
- 60079
- Resolver ID
- CaltechAUTHORS:20150904-125520711
- EC Project CogViSys
- Engineering and Physical Sciences Research Council (EPSRC)
- Caltech CNSE
- NSF
- PASCAL Network of Excellence
- IST-2002-506778
- Created
-
2015-09-15Created from EPrint's datestamp field
- Updated
-
2021-11-10Created from EPrint's last_modified field