Logo: University of Southern California

Events Calendar


  • PhD Dissertation Defense - Shushan Arakelyan

    Fri, Aug 23, 2024 @ 01:00 PM - 03:00 PM

    Thomas Lord Department of Computer Science

    University Calendar


    Title: 
    Building Generalizable Language Models for Code Processing
     
    Abstract:
    Successful deployment of any AI model requires generalization to previously unseen, real-world scenarios. Lack of generalization in models can lead to outcomes ranging from reduced performance to potential legal liabilities. In this thesis, I explore generalization challenges in large language models for code processing. I will discuss three different generalization concerns that language models for code processing can exhibit, and present my progress in building models that can overcome those. 1) I will start by discussing compositional generalization issues, where models must adapt to previously unseen instruction combinations 2) Next I will talk about out-of-domain generalization, and how distribution shifts within single projects or corporations can affect model performance, and how to overcome it. 3) Finally, I will talk about generalization of advanced models to programming languages with fewer resources.
     
    Venue: SAL 213
     
    Date/Time: August 23, 1pm-3pm
     
    Names of the Dissertation Defense Committee members: 
    Xiang Ren (chair), Morteza Dehghani, Aram Galstyan, Mukund Raghothaman
     

    Location: Henry Salvatori Computer Science Center (SAL) - 213

    Audiences: Everyone Is Invited

    Contact: Ellecia Williams

    Add to Google CalendarDownload ICS File for OutlookDownload iCal File

Return to Calendar