paper available by ftp
mccallum@cs.rochester.edu
mccallum at cs.rochester.edu
Wed Jul 6 15:54:40 EDT 1994
FTP-host: ftp.cs.rochester.edu
FTP-file: pub/papers/robotics/94.mccallum-tr502.ps.Z
27 pages.
"First Results with Instance-Based State Identification
for Reinforcement Learning"
R. Andrew McCallum
Department of Computer Science
University of Rochester
Technical Report 502
When a reinforcement learning agent's next course of action depends on
information that is hidden from the sensors because of problems such
as occlusion, restricted range, bounded field of view and limited
attention, we say the agent suffers from the Hidden State Problem.
State identification techniques use history information to uncover
hidden state. Previous approaches to encoding history include: finite
state machines [Chrisman 1992; McCallum 1992], recurrent neural
networks [Lin 1992] and genetic programming with indexed memory
[Teller 1994]. A chief disadvantage of all these techniques is their
long training time.
This report presents Instance-Based State Identification, a new
approach to reinforcement learning with state identification that
learns with much fewer training steps. Noting that learning with
history and learning in continuous spaces both share the property that
they begin without knowing the granularity of the state space, the
approach applies instance-based (or ``memory-based'') learning to
history sequences---instead of recording instances in a continuous
geometrical space, we record instances in action-perception-reward
sequence space.
The first implementation of this approach, called Nearest Sequence
Memory, learns with an order of magnitude fewer steps than several
previous approaches.
The paper is also available through the http URL below:
R. Andrew McCallum EBOX: mccallum at cs.rochester.edu
Computer Science Dept VOX: (716) 275-2527, (716) 275-1372 (lab)
University of Rochester FAX: (716) 461-2018
Rochester, NY 14627-0226 http://www.cs.rochester.edu/u/mccallum
------- End of Blind-Carbon-Copy
More information about the Connectionists
mailing list