paper available by ftp

mccallum@cs.rochester.edu mccallum at cs.rochester.edu
Wed Jul 6 15:54:40 EDT 1994


FTP-host: ftp.cs.rochester.edu
FTP-file: pub/papers/robotics/94.mccallum-tr502.ps.Z
27 pages.

        "First Results with Instance-Based State Identification 
                     for Reinforcement Learning"

                         R. Andrew McCallum
                    Department of Computer Science
                      University of Rochester
                        Technical Report 502

When a reinforcement learning agent's next course of action depends on
information that is hidden from the sensors because of problems such
as occlusion, restricted range, bounded field of view and limited
attention, we say the agent suffers from the Hidden State Problem.
State identification techniques use history information to uncover
hidden state.  Previous approaches to encoding history include: finite
state machines [Chrisman 1992; McCallum 1992], recurrent neural
networks [Lin 1992] and genetic programming with indexed memory
[Teller 1994].  A chief disadvantage of all these techniques is their
long training time.
  
This report presents Instance-Based State Identification, a new
approach to reinforcement learning with state identification that
learns with much fewer training steps.  Noting that learning with
history and learning in continuous spaces both share the property that
they begin without knowing the granularity of the state space, the
approach applies instance-based (or ``memory-based'') learning to
history sequences---instead of recording instances in a continuous
geometrical space, we record instances in action-perception-reward
sequence space.

The first implementation of this approach, called Nearest Sequence
Memory, learns with an order of magnitude fewer steps than several
previous approaches.


The paper is also available through the http URL below:

R. Andrew McCallum          EBOX: mccallum at cs.rochester.edu
Computer Science Dept       VOX: (716) 275-2527, (716) 275-1372 (lab)
University of Rochester     FAX: (716) 461-2018
Rochester, NY  14627-0226   http://www.cs.rochester.edu/u/mccallum

------- End of Blind-Carbon-Copy


More information about the Connectionists mailing list