Learning Team Strategies: Soccer Case Studies

Rafal Salustowicz rafal at idsia.ch
Tue Dec 16 11:00:07 EST 1997



            LEARNING TEAM STRATEGIES: SOCCER CASE STUDIES

Rafal Salustowicz           Marco Wiering          Juergen Schmidhuber

                    IDSIA, Lugano (Switzerland)

                 Revised Technical Report IDSIA-29-97
           To appear in the Machine Learning Journal (1998)

We use simulated soccer   to  study multiagent learning.  Each  team's
players   (agents)  share  action  set and    policy,  but  may behave
differently due to position-dependent  inputs.  All agents making up a
team are  rewarded  or punished collectively  in   case of goals.   We
conduct  simulations  with varying  team   sizes, and compare  several
learning algorithms: TD-Q learning with linear neural networks (TD-Q),
Probabilistic Incremental Program Evolution (PIPE), and a PIPE version
that learns   by coevolution (CO-PIPE).    TD-Q is  based on  learning
evaluation functions  (EFs)  mapping  input/action  pairs  to expected
reward.  PIPE  and CO-PIPE  search  policy  space directly.  They  use
adaptive    probability distributions    to  synthesize programs  that
calculate action  probabilities from current  inputs. Our results show
that   linear  TD-Q  encounters    several  difficulties in   learning
appropriate shared EFs. PIPE and  CO-PIPE, however,  do not depend  on
EFs  and  find good policies  faster  and more reliably. This suggests
that in some  multiagent learning scenarios   direct search in  policy
space can offer advantages over EF-based approaches.

                              http://www.idsia.ch/~rafal/research.html
                             ftp://ftp.idsia.ch/pub/rafal/soccer.ps.gz

Related papers by the same authors:

Evolving soccer strategies. In N. Kasabov, R. Kozma, K. Ko, R. O'Shea,
G. Coghill,  and T. Gedeon,  editors,  Progress in Connectionist-based
Information Systems:Proc. of the 4th Intl. Conf. on Neural Information
Processing ICONIP'97, pages 502-505, Springer-Verlag, Singapore, 1997.
		      ftp://ftp.idsia.ch/pub/rafal/ICONIP_soccer.ps.gz

On learning soccer strategies.  In W. Gerstner, A. Germond, M. Hasler,
and J.-D.  Nicoud, editors, Proc. of the 7th Intl. Conf. on Artificial
Neural Networks (ICANN'97),  volume 1327 of Lecture Notes in  Computer
Science,  pages 769-774, Springer-Verlag Berlin Heidelberg, 1997.
                       ftp://ftp.idsia.ch/pub/rafal/ICANN_soccer.ps.gz

**********************************************************************
                        Rafal Salustowicz
 Istituto Dalle Molle di Studi sull'Intelligenza Artificiale (IDSIA)
 Corso Elvezia 36                   e-mail: rafal at idsia.ch
 6900 Lugano                                ==============
 Switzerland                                raf at cs.tu-berlin.de
 Tel (IDSIA) : +41 91 91198-38              raf at psych.stanford.edu
 Tel (office): +41 91 91198-32
 Fax         : +41 91 91198-39         WWW: http://www.idsia.ch/~rafal
**********************************************************************




More information about the Connectionists mailing list