Paper Announcement

Marco Wiering marco at idsia.ch
Thu Nov 27 04:58:58 EST 1997


          

                      HQ-LEARNING                    

        Adaptive Behavior 6:2, 1997 (in press)        
 
 Marco Wiering                     Juergen Schmidhuber
 marco at idsia.ch                       juergen at idsia.ch
 
 IDSIA,  Corso  Elvezia 36,  6900 Lugano,  Switzerland

 HQ-learning is a hierarchical extension of Q(lambda)- 
 learning designed to solve certain types of partially 
 observable  Markov  decision  problems  (POMDPs).  HQ 
 automatically  decomposes  POMDPs  into  sequences of 
 simpler subtasks  that  can be solved  by  memoryless 
 policies learnable  by reactive subagents.  HQ solves 
 partially observable mazes with more states than used 
 in most previous POMDP work.

 FTP-host:                                ftp.idsia.ch 
 FTP-files:               /pub/marco/HQ-LEARNING.ps.gz
 
 http://www.idsia.ch/~marco/publications.html
 http://www.idsia.ch/~juergen/onlinepub.html

 Marco & Juergen, IDSIA                   www.idsia.ch



More information about the Connectionists mailing list