Paper announcement !!!!!!!

Marco Wiering marco at idsia.ch
Tue Apr 7 08:58:29 EDT 1998



                       Fast Online Q(lambda)        

             Marco Wiering         Juergen Schmidhuber

             To appear in the Machine Learning Journal             

Q(lambda)-learning uses TD(lambda)-methods to accelerate Q-learning. 
The update  complexity of previous  online Q(lambda) implementations 
based on  lookup-tables is bounded  by the size of the  state/action 
space.  Our faster algorithm's update  complexity is  bounded by the 
number  of actions.  The  method  is based on the  observation  that 
Q-value updates may be postponed until they are needed.

Also to be presented at the 10th European Conference On Machine 
Learning (ECML'98), Chemnitz (Germany), April 21-24 1998. 

FTP-host:                                               ftp.idsia.ch
FTP-files:                                   /pub/marco/fast_q.ps.gz  
                                             /pub/marco/ecml_q.ps.gz  
WWW:                    http://www.idsia.ch/~marco/publications.html               
                         http://www.idsia.ch/~juergen/onlinepub.html               

Marco & Juergen                                   IDSIA, Switzerland



More information about the Connectionists mailing list