[AI Seminar] ai-seminar-announce Digest, Vol 72, Issue 6

Mon Mar 20 08:14:00 EDT 2017

A gentle reminder that the talk is tomorrow (Tuesday) noon in NSH 1507!

On Sat, Mar 18, 2017 at 12:00 PM, <ai-seminar-announce-request at cs.cmu.edu>
wrote:

> Send ai-seminar-announce mailing list submissions to
>         ai-seminar-announce at cs.cmu.edu
>
> To subscribe or unsubscribe via the World Wide Web, visit
>         https://mailman.srv.cs.cmu.edu/mailman/listinfo/ai-
> seminar-announce
> or, via email, send a message with subject or body 'help' to
>         ai-seminar-announce-request at cs.cmu.edu
>
> You can reach the person managing the list at
>         ai-seminar-announce-owner at cs.cmu.edu
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of ai-seminar-announce digest..."
>
>
> Today's Topics:
>
>    1.  AI Lunch -- Wen Sun -- March 21 (Unusual Room: NSH       1507)
>       (Adams Wei Yu)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Fri, 17 Mar 2017 23:34:07 -0400
> From: Adams Wei Yu <weiyu at cs.cmu.edu>
> To: ai-seminar-announce at cs.cmu.edu
> Subject: [AI Seminar] AI Lunch -- Wen Sun -- March 21 (Unusual Room:
>         NSH     1507)
> Message-ID:
>         <CABzq7eq+SV=czupGEGXK_XVYmeNwvRgH9WnNessGpAn9BeT1nw@
> mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> Dear faculty and students,
>
> We look forward to seeing you Next Tuesday, March 21, at noon in *NSH 1507*
> for AI lunch. To learn more about the seminar and lunch, please visit the
> AI
>  Lunch webpage <http://www.cs.cmu.edu/~aiseminar/>.
>
> On Tuesday, Wen Sun <http://www.cs.cmu.edu/~wensun/> will give a talk
> titled *Differentiable Imitation Learning and Sequential Prediction*.
>
> *Abstract*: Recently, researchers have demonstrated state-of-the-art
> performance on sequential decision making problems (e.g., robotics control,
> sequential prediction) with deep neural networks and Reinforcement Learning
> (RL). However, for some of these problems, oracles that can demonstrate
> good performance are available during training. In this work, we propose
> AggreVaTeD, a policy gradient extension of the Imitation Learning (IL)
> approach of Ross & Bagnell (2014) that can leverage oracles to achieve
> faster and more accurate solutions with less training data than with a
> less-informed RL approaches. Specifically, we provide a comprehensive
> theoretical study of IL that demonstrates we can expect up to exponentially
> lower sample complexity for learning with AggreVaTeD than with RL
> algorithms. Finally, we present two stochastic gradient procedures that
> learn neural network policies for several problems including a sequential
> prediction task as well as various high dimensional robotics control
> problems. Our results and theory indicate that the proposed approach can
> achieve superior performance with respect to the oracle when the
> demonstrator is sub-optimal.
>
> This a joint work with Arun Venkatraman, Geoff Gordon, Byron Boots and Drew
> Bagnell.
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <http://mailman.srv.cs.cmu.edu/pipermail/ai-seminar-
> announce/attachments/20170317/371f8226/attachment-0001.html>
>
> ------------------------------
>
> Subject: Digest Footer
>
> _______________________________________________
> ai-seminar-announce mailing list
> ai-seminar-announce at cs.cmu.edu
> https://mailman.srv.cs.cmu.edu/mailman/listinfo/ai-seminar-announce
>
> ------------------------------
>
> End of ai-seminar-announce Digest, Vol 72, Issue 6
> **************************************************
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.srv.cs.cmu.edu/pipermail/ai-seminar-announce/attachments/20170320/87208ef9/attachment.html>