-
PhD Thesis Proposal - Qinyuan Ye
Mon, Apr 22, 2024 @ 10:00 AM - 11:30 AM
Thomas Lord Department of Computer Science
University Calendar
Title: Cross-Task Generalization Abilities of Large Language Models
Committee Members: Xiang Ren (Chair), Robin Jia, Swabha Swayamdipta, Jesse Thomason, Morteza Dehghani
Date & Time: Monday, April 22, 10am-11:30am\
Location: SAL 213
Abstract: Humans can learn a new language task efficiently with only a few examples, by leveraging their knowledge and experience obtained when learning prior tasks. Enabling similar cross-task generalization abilities in NLP systems is fundamental for achieving the goal of general intelligence and enabling broader and more scalable adoption of language technology in future applications. In this thesis proposal, I will present my work on (1) benchmarking cross-task generalization abilities with diverse NLP tasks; (2) developing new model architecture for improving cross-task generalization abilities; (3) analyzing and predicting the generalization landscape of current state-of-the-art large language models. Additionally, I will outline future research directions, along with preliminary thoughts on addressing them.
Zoom Link: https://usc.zoom.us/j/93269270403?pwd=NVNmN085bm5SWXNnNGErcXczeVkxdz09Location: Henry Salvatori Computer Science Center (SAL) - 213
Audiences: Everyone Is Invited
Contact: Qinyuan Ye
Event Link: https://usc.zoom.us/j/93269270403?pwd=NVNmN085bm5SWXNnNGErcXczeVkxdz09