Connectionists: Call for papers: NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning

Pascal Poupart ppoupart at cs.uwaterloo.ca
Thu Sep 18 13:23:33 EDT 2008


Call for papers
NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning
http://www.cs.uwaterloo.ca/~ppoupart/nips08-workshop.html
Whistler, BC, Canada
December 13, 2008


Important Dates
---------------
* Oct 30: submission deadline
* Nov 4: notification of acceptance


Overview
--------

Reinforcement Learning (RL) problems are typically formulated in terms of 
Stochastic Decision Processes (SDPs), or a specialization thereof, 
Markovian Decision Processes (MDPs), with the goal of identifying an 
optimal control policy. In contrast to planning problems, RL problems are 
characterized by the lack of complete information concerning the 
transition and reward models of the SDP. Hence, algorithms for solving RL 
problems need to estimate properties of the system from finite data. 
Naturally, any such estimated quantity has inherent uncertainty. One of 
the interesting and challenging aspects of RL is that the algorithms have 
partial control over the data sample they observe, allowing them to 
actively control the amount of this uncertainty, and potentially trade it 
off against performance.

Reinforcement Learning as a field of research, has over the past few years 
seen renewed interest in methods that explicitly consider the 
uncertainties inherent to the learning process. Indeed, interest in 
data-driven models that take uncertainties into account, goes beyond RL to 
the fields of Control Theory, Operations Research and Statistics. Within 
the RL community, relevant lines of research may be classified into the 
following (partially overlapping) sub-fields:

1- Bayesian RL. Bayesian methods attempt to explicitly model uncertainties 
using posterior probability distributions, computed using Bayes' rule. 
Such Bayesian modeling may be used in estimating the MDP's transition and 
reward distributions; or in estimating other quantities that are more 
directly related to performance, such as value function and policy 
gradient.

2- Risk sensitive and robust dynamic decision making. These methods use 
information beyond the expected return, to compute policies that are 
robust to inaccuracies in the estimated model. Such quantities include 
quantiles, as well as higher order moments of the return random variable. 
A closely related family of methods use expectations of non-linear 
mappings of the return, as their measures of performance.

3- RL with confidence intervals. This research is concerned with methods 
that employ Frequentist measures of model uncertainties, based on 
confidence intervals. Much of this research is focused on on-line 
algorithms, whose performance is evaluated concurrently with the learning 
process.

4- Applications of risk-aware and uncertainty-aware decision-making. 
Applications in mission critical tasks, finance, and other risk-sensitive 
domains, where uncertainties have to be taken into account, in order to 
establish a level of worst-case performance, or to guarantee a minimum 
level of performance that may be achieved with high probability.

This workshop is aimed at bringing together researchers working in these 
and related fields, allow them to present their current research, and 
discuss possible directions for future work. We intend to focus on 
possible interactions between the sub-fields listed above, as well as on 
interactions with other related fields, which are outside of the current 
RL mainstream.


Workshop format
---------------

This is a one-day workshop consisting of:

1- Invited talks

2- Contributed talks

3- Panel discussions

3.1- Models that work and those that don't: participants will discuss 
specific applications and theoretical models and share experience regarding 
the effectiveness of different approaches. 

3.2- Benchmarks and challenges: discussion of some proposals for sample 
problems that encompass the core challenges of model uncertainty and 
risk sensitive control that could serve as benchmarks and/or challenges.

4- Poster session


Call for Contributions
----------------------

Participants are invited to submit either a technical paper (eight pages 
in the conference format) or an extended abstract (up to two pages) 
describing research relevant to the workshop. Submissions should be sent 
via email to Pascal Poupart at ppoupart at cs.uwaterloo.ca by Oct 30th 
in Postscript, PDF, or MS Word format. Previously published work that is 
reworded, summarized or extended may be submitted to the workshop.  
However, priority will be given to novel work.  If the papers are of 
sufficient quantity and quality, we will seek to publish them as an 
edited book or journal special issue.


Important Dates
---------------

Oct 30: submission deadline
Nov 4: notification of acceptance
Dec 13: workshop in Whistler


Workshop webpage
----------------

http://www.cs.uwaterloo.ca/~ppoupart/nips08-workshop.html


Organizing Committee
--------------------

1- Yaakov Engel (yakiengel at gmail.com)
2- Mohammad Ghavamzadeh (INRIA - Team SequeL, mgh at cs.ualberta.ca),
3- Shie Mannor (McGill University, shie at ece.mcgill.ca)
4- Pascal Poupart (University of Waterloo, ppoupart at cs.uwaterloo.ca)


-- 
------------------------
Pascal Poupart
Assistant Professor
David R. Cheriton School of Computer Science
University of Waterloo
200 University Avenue West
Waterloo, Ontario
Canada N2L 3G1
------------------------
Web: http://www.cs.uwaterloo.ca/~ppoupart
Email: ppoupart at cs.uwaterloo.ca 
Telephone: 1-519-888-4567x36239 
Fax: 1-519-885-1208
------------------------




More information about the Connectionists mailing list