Tue, Apr 25, 2023 @ 12:00 PM - 02:00 PM
Thomas Lord Department of Computer Science
PHD Thesis Proposal - Woojeong Jin
Title: Towards a Better Reasoner on Visual Information
Humans acquire knowledge by processing visual information through observation and imagination, which expands our reasoning capability about the physical world we encounter every day. Despite significant progress in solving AI problems, current state-of-the-art models in natural language processing (NLP) and computer vision (CV) have limitations in terms of reasoning and generalization, particularly with complex reasoning on visual information and generalizing to unseen vision-language tasks. This thesis proposal aims to address these shortcomings by presenting a series of works that enable smaller vision-language (VL) models to generalize to new tasks, improve language models by incorporating visual information, and evaluate language models by assessing their ability to reason about the physical world through text.
12 pm on 4/25
Committee Members: Xiang Ren, Ram Nevatia, Jesse Thomason, Robin Jia, Emilio Ferrara.
Audiences: Everyone Is Invited
Contact: Asiroh Cham
Event Link: https://usc.zoom.us/j/98941948220