Tech Report: Eye movements - a computational study
Rajesh Rao
rao at cs.rochester.edu
Mon Mar 17 22:34:36 EST 1997
The following report describing a computational model of eye movements
in visual cognition is available for retrieval via ftp.
Keywords: Saccades, spatiochromatic filters, saliency maps, spatial
memory, object-centered maps, reference frames
Comments and suggestions welcome (This message has been cross-posted -
my apologies to those who received it more than once).
--
Rajesh Rao Internet: rao at cs.rochester.edu
Dept. of Computer Science VOX: (716) 275-2527
University of Rochester FAX: (716) 461-2018
Rochester NY 14627-0226 WWW: http://www.cs.rochester.edu/u/rao/
===========================================================================
Eye Movements in Visual Cognition:
A Computational Study
Rajesh P.N. Rao, Gregory J. Zelinsky,
Mary M. Hayhoe, and Dana H. Ballard
Technical Report 97.1
National Resource Laboratory for the Study of Brain and Behavior
University of Rochester
March 1997
Abstract
Visual cognition depends critically on the moment-to-moment
orientation of gaze. Gaze is changed by saccades, rapid eye
movements that orient the fovea over targets of interest in a visual
scene. Saccades are ballistic; a prespecified target location is
computed prior to the movement and visual feedback is precluded.
Once a target is fixated, gaze is typically held for about 300
milliseconds, although it can be held for both longer and shorter
intervals. Despite these distinctive properties, there has been no
specific computational model of the gaze targeting strategy employed
by the human visual system during visual cognitive tasks. This
paper proposes such a model that uses iconic scene representations
derived from oriented spatiochromatic filters at multiple
scales. Visual search for a target object proceeds in a
coarse-to-fine fashion with the target's largest scale filter
responses being compared first. Task-relevant target locations are
represented as saliency maps which are used to program eye
movements. Once fixated, targets are remembered by using spatial
memory in the form of object-centered maps. The model was
empirically tested by comparing its performance with actual eye
movement data from human subjects in natural visual search tasks.
Experimental results indicate excellent agreement between eye
movements predicted by the model and those recorded from human
subjects.
Retrieval information:
FTP-host: ftp.cs.rochester.edu
FTP-pathname: /pub/u/rao/papers/tr97.1.ps.Z
URL: ftp://ftp.cs.rochester.edu/pub/u/rao/papers/tr97.1.ps.Z
35 pages; 1385K compressed, 6667K uncompressed
-------------------------------------------------------------------------
Anonymous ftp instructions:
>ftp ftp.cs.rochester.edu
Connected to anon.cs.rochester.edu.
220 anon.cs.rochester.edu FTP server (Version wu-2.4(3)) ready.
Name: [type 'anonymous' here]
331 Guest login ok, send your complete e-mail address as password.
Password: [type your e-mail address here]
ftp> cd /pub/u/rao/papers/
ftp> get tr97.1.ps.Z
ftp> bye
>uncompress tr97.1.ps.Z
>lpr tr97.1.ps
More information about the Connectionists
mailing list