Paper Announcement
Marco Wiering
marco at idsia.ch
Thu Nov 27 04:58:58 EST 1997
HQ-LEARNING
Adaptive Behavior 6:2, 1997 (in press)
Marco Wiering Juergen Schmidhuber
marco at idsia.ch juergen at idsia.ch
IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland
HQ-learning is a hierarchical extension of Q(lambda)-
learning designed to solve certain types of partially
observable Markov decision problems (POMDPs). HQ
automatically decomposes POMDPs into sequences of
simpler subtasks that can be solved by memoryless
policies learnable by reactive subagents. HQ solves
partially observable mazes with more states than used
in most previous POMDP work.
FTP-host: ftp.idsia.ch
FTP-files: /pub/marco/HQ-LEARNING.ps.gz
http://www.idsia.ch/~marco/publications.html
http://www.idsia.ch/~juergen/onlinepub.html
Marco & Juergen, IDSIA www.idsia.ch
More information about the Connectionists
mailing list