three technical reports related to reinforcement learning
Ron Sun
rsun at research.nj.nec.com
Wed Apr 7 13:16:36 EDT 1999
Announcing three technical reports
(concerning enhancing reinforcement learners, either in terms of
improving their learning processes by dividing up the space or
sequence, or in terms of knowledge extraction from outcomes of
reinforcement learning)
---------------------------------
Learning Plans without a priori Knowledge
by Ron Sun and Chad Sessions
http://cs.ua.edu/~rsun/sun.plan.ps
ABSTRACT
This paper is concerned with autonomous learning of plans in
probabilistic domains without a priori domain-specific knowledge.
Different from existing reinforcement learning algorithms that
generate only reactive plans and existing probabilistic planning
algorithms that require a substantial amount of a priori knowledge
in order to plan, a two-stage bottom-up process is devised,
in which first reinforcement learning is
applied, without the use of a priori domain-specific knowledge,
to acquire a reactive plan and then explicit plans are extracted
from the reactive plan. Several options in plan extraction are
examined, each of which is based on beam search that performs
temporal projection in a restricted fashion, guided by the
value functions resulting from reinforcement learning.
Some completeness and soundness results are given. Examples in several
domains are discussed that together demonstrate the working of
the proposed model.
A shortened version appeared in:
Proc. 1998 International Symposium on Intelligent Data Engineering
and Learning, October, 1998. Springer-Verlag.
---------------------------------
Multi-Agent Reinforcement Learning: Weighting and Partitioning
by Ron Sun and Todd Peterson
http://cs.ua.edu/~rsun/sun.NN99.ps
ABSTRACT:
This paper addresses weighting and partitioning,
in complex reinforcement learning tasks, with the aim
of facilitating learning. The paper presents some
ideas regarding weighting of multiple agents
and extends them into partitioning an input/state space into
multiple regions with differential weighting in these regions,
to exploit differential characteristics of regions and
differential characteristics of agents to
reduce the learning complexity of agents (and their
function approximators) and thus to facilitate the learning
overall. It analyzes, in reinforcement learning tasks,
different ways of partitioning a task
and using agents selectively based on partitioning.
Based on the analysis, some heuristic methods are described
and experimentally tested. We find that some off-line
heuristic methods performed the best,
significantly better than single-agent models.
To appear in: Neural Networks, in press.
A shortened version appeared in Proc. of IJCNN'99
---------------------------------
Self-Segmentation of Sequences:
Automatic Formation of Hierarchies of Sequential Behaviors
by Ron Sun and Chad Sessions
http://cs.ua.edu/~rsun/sun.sss.ps
ABSTRACT
The paper presents an approach for hierarchical reinforcement
learning that does not rely on a priori domain-specific
knowledge regarding hierarchical structures. Thus this work deals
with a more difficult problem compared with existing work. It
involves learning to segment action sequences to create hierarchical
structures, based on reinforcement received during task execution,
with different levels of control communicating with each other
through sharing reinforcement estimates obtained by each other.
The algorithm segments action sequences to reduce non-Markovian
temporal dependencies, and seeks out proper configurations of
long- and short-range dependencies, to facilitate the learning
of the overall task. Developing hierarchies
also facilitates the extraction of explicit hierarchical plans.
The initial experiments demonstrate the promise of the approach.
A shortened version of this report appeared in Proc. IJCNN'99.
Washington, DC.
---------------------------------
Dr. Ron Sun
NEC Research Institute
4 Independence Way
Princeton, NJ 08540
phone: 609-520-1550
fax: 609-951-2483
email: rsun at research.nj.nec.com
(July 1st, 1998 -- July 1st, 1999)
-----------------------------------------
Prof. Ron Sun http://cs.ua.edu/~rsun
Department of Computer Science
and Department of Psychology phone: (205) 348-6363
The University of Alabama fax: (205) 348-0219
Tuscaloosa, AL 35487 email: rsun at cs.ua.edu
More information about the Connectionists
mailing list