USC - Viterbi School of Engineering

Mar
09

CS Colloquium: Dani Yogatama (DeepMind) - Learning General Language Processing Agents
Tue, Mar 09, 2021 @ 09:00 AM - 10:00 AM
Thomas Lord Department of Computer Science
Conferences, Lectures, & Seminars

Speaker: Dani Yogatama, DeepMind

Talk Title: Learning General Language Processing Agents

Series: CS Colloquium

Abstract: The ability to continuously learn and generalize to new problems quickly is a hallmark of general intelligence. Existing machine learning models work well when optimized for a particular benchmark, but they require many in-domain training examples (i.e., input-output pairs that are often costly to annotate), overfit to the idiosyncrasies of the benchmark, and do not generalize to out-of-domain examples. In contrast, humans are able to accumulate task-agnostic knowledge from multiple modalities to facilitate faster learning of new skills.

In this talk, I will argue that obtaining such an ability for a language model requires significant advances in how we acquire, represent, and store knowledge in artificial systems. I will present two approaches in this direction: (i) an information theoretic framework that unifies several representation learning methods used in many domains (e.g., natural language processing, computer vision, audio processing) and allows principled constructions of new training objectives to learn better language representations; and (ii) a language model architecture that separates computation (information processing) in a large neural network and memory storage in a key-value database. I will conclude by briefly discussing a series of future research programs toward building a general linguistically intelligent agent.

This lecture satisfies requirements for CSCI 591: Research Colloquium

Biography: Dani Yogatama is a staff research scientist at DeepMind. His research interests are in machine learning and natural language processing. He received his PhD from Carnegie Mellon University in 2015. He grew up in Indonesia and was a Monbukagakusho scholar in Japan prior to studying at CMU.

Host: Xiang Ren

Audiences: By invitation only.

Contact: Assistant to CS chair

This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.
Add to Google Calendar

Return to Calendar

CS Colloquium: Dani Yogatama (DeepMind) - Learning General Language Processing Agents