Paper announcement !!!!!!!
Marco Wiering
marco at idsia.ch
Tue Apr 7 08:58:29 EDT 1998
Fast Online Q(lambda)
Marco Wiering Juergen Schmidhuber
To appear in the Machine Learning Journal
Q(lambda)-learning uses TD(lambda)-methods to accelerate Q-learning.
The update complexity of previous online Q(lambda) implementations
based on lookup-tables is bounded by the size of the state/action
space. Our faster algorithm's update complexity is bounded by the
number of actions. The method is based on the observation that
Q-value updates may be postponed until they are needed.
Also to be presented at the 10th European Conference On Machine
Learning (ECML'98), Chemnitz (Germany), April 21-24 1998.
FTP-host: ftp.idsia.ch
FTP-files: /pub/marco/fast_q.ps.gz
/pub/marco/ecml_q.ps.gz
WWW: http://www.idsia.ch/~marco/publications.html
http://www.idsia.ch/~juergen/onlinepub.html
Marco & Juergen IDSIA, Switzerland
More information about the Connectionists
mailing list