Learning viewpoint invariant object representations using a temporal coherence principle
Abstract
Invariant object recognition is arguably one of the major challenges for contemporary machine vision systems. In contrast, the mammalian visual system performs this task virtually effortlessly. How can we exploit our knowledge on the biological system to improve artificial systems? Our understanding of the mammalian early visual system has been augmented by the discovery that general coding principles could explain many aspects of neuronal response properties. How can such schemes be transferred to system level performance? In the present study we train cells on a particular variant of the general principle of temporal coherence, the "stability" objective. These cells are trained on unlabeled real-world images without a teaching signal. We show that after training, the cells form a representation that is largely independent of the viewpoint from which the stimulus is looked at. This finding includes generalization to previously unseen viewpoints. The achieved representation is better suited for view-point invariant object classification than the cells' input patterns. This property to facilitate view-point invariant classification is maintained even if training and classification take place in the presence of an – also unlabeled – distractor object. In summary, here we show that unsupervised learning using a general coding principle facilitates the classification of real-world objects, that are not segmented from the background and undergo complex, non-isomorphic, transformations.
Additional Information
Received: 19 February 2004 / Accepted: 23 May 2005 / Published online: 13 July 2005 © Springer-Verlag 2005. This work was supported by the EU-AMOUSE (JH) project and the Swiss National Science Foundation (PK, grant-no. 31-61415.01). We are grateful to K.P. Kording for making his MATLAB code for optimizing the stability objective available.Attached Files
Published - Learning_viewpoint_invariant_object.pdf
Files
Name | Size | Download all |
---|---|---|
md5:3dc2e1c411efd32afcbd905d7531943b
|
562.9 kB | Preview Download |
Additional details
- Eprint ID
- 40365
- Resolver ID
- CaltechAUTHORS:20130816-103140484
- EU-AMOUSE
- Swiss National Science Foundation
- 31-61415.01
- Created
-
2008-01-12Created from EPrint's datestamp field
- Updated
-
2023-09-26Created from EPrint's last_modified field
- Caltech groups
- Koch Laboratory (KLAB)