paper available: incremental self-improvement
Juergen Schmidhuber
schmidhu at informatik.tu-muenchen.de
Wed Feb 1 06:16:36 EST 1995
ON LEARNING HOW TO LEARN LEARNING STRATEGIES
Technical Report FKI-198-94 (20 pages)
Juergen Schmidhuber
Fakultaet fuer Informatik
Technische Universitaet Muenchen
80290 Muenchen, Germany
November 24, 1994
Revised January 31, 1995
This paper introduces the ``incremental self-improvement paradigm''.
Unlike previous methods, incremental self-improvement encourages a
reinforcement learning system to improve the way it learns, and to
improve the way it improves the way it learns, without significant
theoretical limitations -- the system is able to ``shift its induc-
tive bias'' in a universal way. Its major features are: (1) There
is no explicit difference between ``learning'', ``meta-learning'',
and other kinds of information processing. Using a Turing machine
equivalent programming language, the system itself occasionally
executes self-delimiting, initially highly random ``self-modifi-
cation programs'' which modify the context-dependent probabilities
of future programs (including future self-modification programs).
(2) The system keeps only those probability modifications computed
by ``useful'' self-modification programs: those which bring about
more payoff per time than all previous self-modification programs.
(3) The computation of payoff per time takes into account all the
computation time required for learning -- the entire system life is
considered: boundaries between learning trials are ignored (if there
are any). A particular implementation based on the novel paradigm is
presented. It is designed to exploit what conventional digital
machines are good at: fast storage addressing, arithmetic operations
etc. Experiments illustrate the system's mode of operation.
-------------------------------------------------------------------
FTP-host: flop.informatik.tu-muenchen.de (131.159.8.35)
FTP-filename: /pub/fki/fki-198-94.ps.gz (use gunzip to uncompress)
Alternatively, retrieve the paper from my home page:
http://papa.informatik.tu-muenchen.de/mitarbeiter/schmidhu.html
If you don't have gzip/gunzip, I can mail you an uncompressed
postscript version (as a last resort). There will be a future
revised version of the tech report. Comments are welcome.
Juergen Schmidhuber
More information about the Connectionists
mailing list