RL papers available by ftp
Rich Sutton
sutton at gte.com
Sun Feb 19 13:03:59 EST 1995
The following previously published papers related to reinforcement learning
are available online for the first time:
Sutton, R.S. (1988) "Learning to predict by the methods of temporal
differences," Machine Learning, 3, 1988, No. 1, pp. 9--44.
Sutton, R.S. (1990) "Integrated architectures for learning, planning, and
reacting based on approximating dynamic programming," Proceedings of the
Seventh International Conference on Machine Learning, pp. 216--224,
Morgan Kaufmann.
Sutton, R.S. (1991a) "Planning by incremental dynamic programming,"
Proceedings of the Eighth International Workshop on Machine Learning,
pp. 353-357, Morgan Kaufmann.
Sutton, R.S. (1991b) "Dyna, an integrated architecture for learning,
planning and reacting," Working Notes of the 1991 AAAI Spring Symposium
on Integrated Intelligent Architectures} and SIGART Bulletin 2, pp. 160-163.
Sutton, R.S. (1992a) "Adapting Bias by Gradient Descent: An Incremental
Version of Delta-Bar-Delta," Proceedings of the Tenth National Conference
on Artificial Intelligence, pp. 171-176, MIT Press.
Sutton, R.S., Whitehead, S.D. (1993) "Online learning with random
representations." Proceedings of the Tenth Annual
Conference on Machine Learning, pp. 314-321, Morgan Kaufmann.
These papers can be obtained by ftp from the small archive at
ftp.gte.com/reinforcement-learning. See the file CATALOG for filenames and
abstracts.
More information about the Connectionists
mailing list