Target detection using saliency-based attention
- Creators
- Itti, Laurent
-
Koch, Christof
Abstract
Most models of visual search, whether involving overt eye movements or covert shifts of attention, are based on the concept of a "saliency map", that is, an explicit two-dimensional map that encodes the saliency or conspicuity of objects in the visual environment. Competition among neurons in this map gives rise to a single winning location that corresponds to the next attended target. Inhibiting this location automatically allows the system to attend to the next most salient location. We describe a detailed computer implementation of such a scheme, focusing on the problem of combining information across modalities, here orientation, intensity and color information, in a purely stimulus-driven manner. We have successfully applied this model to a wide range of target detection tasks, using synthetic and natural stimuli. Performance has however remained difficult to objectively evaluate on natural scenes, because no objective reference was available for comparison. We here present predicted search times for our model on the Search2 database of rural scenes containing a military vehicle. Overall, we found a poor correlation between human and model search times. Further analysis however revealed that in 3/4 of the images, the model appeared to detect the target faster than humans (for comparison, we calibrated the model's arbitrary internal time frame such that no more than 2-4 image locations were visited per second). It hence seems that this model, which had originally been designed not to find small, hidden military vehicles, but rather to find the few most obviously conspicuous objects in an image, performed as an efficient target detector on the Search2 dataset.
Additional Information
Copyright © RTO/NATO 2000. Papers presented at the RTO Workshop organised by the Systems Concepts and Integration Panel (SCI), held in Utrecht, The Netherlands, 21-23 June 1999. We thank Dr. A. Toet from TNO-HFRI for providing us with the search2 dataset and all human data. This work was supported by NSF (Caltech ERC), NIMH, ONR and NATO.Attached Files
Published - Koch-pp44-53-MP-045-$$ALL.pdf
Published - MP-045-$$ALL.PDF
Accepted Version - 333.pdf
Files
Additional details
- Eprint ID
- 40573
- Resolver ID
- CaltechAUTHORS:20130816-103250017
- NSF ERC
- NIMH
- U.S. Office of Naval Research
- NATO
- Created
-
2008-01-16Created from EPrint's datestamp field
- Updated
-
2020-03-03Created from EPrint's last_modified field
- Caltech groups
- Koch Laboratory (KLAB)
- Series Name
- RTO Meeting Proceedings
- Series Volume or Issue Number
- 45
- Other Numbering System Name
- AC/323(SCI)TP
- Other Numbering System Identifier
- 19