Paper Announcement
    Marco Wiering 
    marco at idsia.ch
       
    Thu Nov 27 04:58:58 EST 1997
    
    
  
          
                      HQ-LEARNING                    
        Adaptive Behavior 6:2, 1997 (in press)        
 
 Marco Wiering                     Juergen Schmidhuber
 marco at idsia.ch                       juergen at idsia.ch
 
 IDSIA,  Corso  Elvezia 36,  6900 Lugano,  Switzerland
 HQ-learning is a hierarchical extension of Q(lambda)- 
 learning designed to solve certain types of partially 
 observable  Markov  decision  problems  (POMDPs).  HQ 
 automatically  decomposes  POMDPs  into  sequences of 
 simpler subtasks  that  can be solved  by  memoryless 
 policies learnable  by reactive subagents.  HQ solves 
 partially observable mazes with more states than used 
 in most previous POMDP work.
 FTP-host:                                ftp.idsia.ch 
 FTP-files:               /pub/marco/HQ-LEARNING.ps.gz
 
 http://www.idsia.ch/~marco/publications.html
 http://www.idsia.ch/~juergen/onlinepub.html
 Marco & Juergen, IDSIA                   www.idsia.ch
    
    
More information about the Connectionists
mailing list