Thu, Dec 12, 2019 @ 11:00 AM - 12:00 PM
Information Sciences Institute
Conferences, Lectures, & Seminars
Speaker: Soravit Beer Changpinyo , Google AI
Talk Title: Tightly Connecting Vision and Language
Series: Natural Language Seminar
Abstract: Remarkable progress has been made at the intersection of vision and language. While showing great promise, current vision and language models do not function well in the wild. In this talk, I will present our recent efforts aiming to bridge this gap for the tasks of image captioning and visual question answering. I will first describe several practical limitations of current benchmarks as a yardstick for grounded language understanding and visual reasoning. Then, I will describe our simple approach to transfer learning, where we leverage large-scale ultrafine grained data as a means to address the long tail of language. Finally, given these results, I will outline future directions and survey a variety of on-going work along the line of making vision and language research useful.
Biography: Soravit Changpinyo is a Software Engineer at Google AI. His research interests are in machine learning with applications to computer vision and natural language processing. Prior to joining Google, he was a PhD candidate and an Annenberg Fellow at the University of Southern California, advised by Fei Sha.
Host: Emily Sheng
More Info: https://nlg.isi.edu/nl-seminar
WebCast Link: https://bluejeans.com/s/unxRW/
Audiences: Everyone Is Invited
Contact: Peter Zamar