<div dir="ltr">Quick reminder that this talk is happening later today.</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Apr 24, 2024 at 11:59 AM Victor Akinwande <<a href="mailto:vakinwan@andrew.cmu.edu">vakinwan@andrew.cmu.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Dear all,<br><br>We look forward to seeing you <span style="background-color:rgb(255,255,0)">next Tuesday (04/30) from 12:00-1:00 PM (ET) </span>for the next talk of this semester's CMU AI Seminar, sponsored by SambaNova Systems (<a href="https://sambanova.ai" target="_blank">https://sambanova.ai</a>). The seminar will be held in NSH 3305 with pizza provided and will be streamed on Zoom.<br><br>To learn more about the seminar series or to see the future schedule, please visit the seminar website (<a href="http://www.cs.cmu.edu/~aiseminar/" target="_blank">http://www.cs.cmu.edu/~aiseminar/</a>).<br><br><span style="background-color:rgb(255,255,0)">Next Tuesday (04/30), Olatunji (Tunji) Ruwase (Microsoft Research) will be giving a virtual talk titled "DeepSpeed: Enabling efficient trillion parameter scale training for deep learning models".</span><br><br>///////////////////<div><br><b>Talk Abstract: </b>Deep Learning is driving unprecedented progress in a wide range of Artificial Intelligence domains, including natural language processing, vision, speech, and multimodal. Sustaining this rapid pace of AI revolution, however, requires practical solutions to the extreme demands of model scaling on the compute, memory, communication and storage components of modern computing hardware. To address this challenge, we created a deep learning optimization library called DeepSpeed to make distributed model training and inference efficient, effective, and easy on commodity hardware. This talk will focus on DeepSpeed optimizations for improving memory, compute, communication, and data efficiency of extreme-scale model training.<br><br><b>Speaker Bio: </b>Olatunji (Tunji) Ruwase is the lead and co-founder of the DeepSpeed project at Microsoft. His broad industry and research background spans compilers, operating systems, and hardware accelerators. His current focus is on systems and convergence optimizations, and frameworks for efficient distributed training and inference of deep learning models. His research results on deep learning training, inference, and hyperparameter search are used in multiple Microsoft systems and products, such as Azure, Ads, Bing, Catapult, and HyperDrive. Tunji earned a PhD in Computer Science from Carnegie Mellon University under the guidance of Professor Todd Mowry.<br><br><div>///////////////////<br><br><b>In person: NSH 3305<br>Zoom Link: <a href="https://cmu.zoom.us/j/99510233317?pwd=ZGx4aExNZ1FNaGY4SHI3Qlh0YjNWUT09" target="_blank">https://cmu.zoom.us/j/99510233317?pwd=ZGx4aExNZ1FNaGY4SHI3Qlh0YjNWUT09</a></b><br><div><br></div><div>- Victor & Asher</div></div></div></div>
</blockquote></div>