Logo: University of Southern California

Events Calendar



Select a calendar:



Filter April Events by Event Type:



Student Activity
Events for April

  • PhD Defense- Tiancheng Jin

    Mon, Apr 22, 2024 @ 04:00 PM - 05:30 PM

    Thomas Lord Department of Computer Science

    Student Activity


    PhD Defense- Tiancheng Jin
    Title: Robust and Adaptive Online Reinforcement Learning 
    Committee: Haipeng Luo (Chair), Rahul Jain, Vatsal Sharron
     
    Abstract: Reinforcement learning (RL) is a machine learning (ML) technique on learning to make optimal sequential decisions via interactions with an environment. In recent years, RL achieved great success in many artificial intelligence tasks, and has been widely regarded as one of the keys towards Artificial General Intelligence (AGI). However, most RL models are trained on simulators, and suffer from the reality gap: a mismatch between simulated and real-world performance. Moreover, recent work has shown that RL models are especially vulnerable to adversarial attacks. This motivates the research on improving the robustness of RL, that is, the ability of ensuring worst-case guarantees.

    On the other hand, it is not favorable to be too conservative/pessimistic and sacrifice too much performance while the environment is not difficult to deal with.In other words, adaptivity --- the capability of automatically adapting to the maliciousness of the environment, is especially desirable to RL algorithms: they should not only target worst-case guarantee, but also pursue instance optimality and achieve better performance against benign environments.
    In this thesis, we focus on designing practical, robust and adaptive reinforcement algorithms.

    Specifically, we take inspiration from the online learning literature, and consider interacting with a sequence of Markov Decision Processes (MDPs), which captures the nature of changing environment. We hope that the techniques and insight developed in this thesis could shed light on improving existing deep RL algorithms for future applications.

    Location: Kaprielian Hall (KAP) - 141

    Audiences: Everyone Is Invited

    Contact: Tiancheng Jin

    OutlookiCal