On the Optimality of Spatial Attention for Object Detection
- Creators
- Harel, Jonathan
-
Koch, Christof
Abstract
Studies on visual attention traditionally focus on its physiological and psychophysical nature [16,18,19], or its algorithmic applications [1,9,21]. We here develop a simple, formal mathematical model of the advantage of spatial attention for object detection, in which spatial attention is defined as processing a subset of the visual input, and detection is an abstraction with certain failure characteristics. We demonstrate that it is suboptimal to process the entire visual input given prior information about target locations, which in practice is almost always available in a video setting due to tracking, motion, or saliency. This argues for an attentional strategy independent of computational savings: no matter how much computational power is available, it is in principle better to dedicate it preferentially to selected portions of the scene. This suggests, anecdotally, a form of environmental pressure for the evolution of foveated photoreceptor densities in the retina. It also offers a general justification for the use of spatial attention in machine vision.
Additional Information
© 2009 Springer-Verlag Berlin Heidelberg. We wish to thank DARPA for its generous support of a research program for the development of a biologically modeled object recognition system, and our close collaborators on that program, Sharat Chikkerur at MIT, and Rob Peters at USC.Additional details
- Eprint ID
- 18819
- DOI
- 10.1007/978-3-642-00582-4_1
- Resolver ID
- CaltechAUTHORS:20100625-153044964
- Defense Advanced Research Projects Agency (DARPA)
- Created
-
2010-08-02Created from EPrint's datestamp field
- Updated
-
2021-11-08Created from EPrint's last_modified field
- Caltech groups
- Koch Laboratory (KLAB)
- Series Name
- Lecture Notes in Artificial Intelligence
- Series Volume or Issue Number
- 5395