USC - Viterbi School of Engineering

Dec
09

World models beyond autoregressive next state prediction
Mon, Dec 09, 2024 @ 03:00 PM - 04:00 PM
Ming Hsieh Department of Electrical and Computer Engineering, Thomas Lord Department of Computer Science, USC School of Advanced Computing
Conferences, Lectures, & Seminars

Speaker: Abhishek Gupta, Ph.D., Assistant Professor of Computer Science and Engineering, Paul G. Allen School at the University of Washington

Talk Title: World models beyond autoregressive next state prediction

Series: CSC@USC/CommNetS-MHI Seminar Series

Abstract: Learned models of system dynamics provide an appealing way of predicting the future outcomes in a system, enabling downstream usage for planning or off-policy evaluation in applications such as robotics. However, the prevalent paradigm of autoregressive, next-state prediction in learning dynamics models is challenging to scale to environments with high dimensional observations and long horizons. In this talk, I will present alternative techniques for model learning that go beyond directly predicting next states. Firstly, we will discuss a reconstruction-free class of models that go beyond next-observation prediction by learning the evolution of task-directed latent representations for high dimensional observation spaces. We will then show how this can be generalized to learning a new class of models that avoid autoregressive prediction altogether by directly modeling long-term cumulative outcomes, while remaining task agnostic. In doing so, this talk will propose alternative ways of thinking about model learning that retain the benefits of transferability and efficiency from model-based RL, while going beyond next-state prediction.

Biography: Abhishek Gupta is an assistant professor of computer science and engineering at the Paul G. Allen School at the University of Washington. Prior to joining University of Washington, he was a post-doctoral scholar at MIT, collaborating with Russ Tedrake and Pulkit Agarwal. He completed his Ph.D. at UC Berkeley working with Pieter Abbeel and Sergey Levine, building systems that can leverage reinforcement learning algorithms to solve robotics problems. He is interested in research directions that enable directly performing reinforcement learning directly in the real world — reward supervision in reinforcement learning, large scale real world data collection, learning from demonstrations, and multi-task reinforcement learning. He has also spent time at Google Brain. He is a recipient of the NDSEG and NSF graduate research fellowships, and several of his works have been presented as spotlight presentations at top-tier machine learning and robotics conferences.

Host: Erdem Biyik

Location: Hughes Aircraft Electrical Engineering Center (EEB) - 248
Audiences: Everyone Is Invited

Contact: Erdem Biyik

This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.
Add to Google Calendar

Return to Calendar

Events Calendar

World models beyond autoregressive next state prediction