USC - Viterbi School of Engineering

Jun
22

Ph.D. Thesis Proposal - Aniruddh G. Puranic
Wed, Jun 22, 2022 @ 12:00 PM - 02:00 PM
Thomas Lord Department of Computer Science
University Calendar

Candidate: Aniruddh G. Puranic

Thesis title: Learning from Demonstrations with Temporal Logics

Committee: Jyotirmoy V. Deshmukh, Stefanos Nikolaidis, Gaurav Sukhatme, Mukund Raghothaman, Somil Bansal, Julie Shah (MIT)

Date: June 22, 2022 (Wednesday)
Time: 12pm - 2pm Pacific Time
Location: SAL 213

Abstract:

Learning-from-demonstrations (LfD) is a popular paradigm to obtain effective robot control policies for complex tasks via reinforcement learning without the need to explicitly design reward functions. However, it is susceptible to imperfections in demonstrations and raises concerns of safety and interpretability in the learned control policies. To address these issues, we propose to use Signal Temporal Logic (STL) to express high-level robotic tasks and use its quantitative semantics to evaluate and rank the quality of demonstrations. Temporal logic-based specifications allow us to create non-Markovian rewards and are also capable of defining interesting causal dependencies between tasks such as sequential task specifications. We present our completed work which proposed the LfD-STL framework that learns from even suboptimal/imperfect demonstrations and STL specifications to infer rewards on which reinforcement learning can be performed to obtain control policies. Through numerous experiments, we have shown that our approach outperforms prior LfD methods.

We then propose further extensions to this framework to develop metrics that provide intuitive explanations about demonstrators' behaviors, which combined with the interpretability of the learned robot policies, can help in building a safe and trusted robotic system for human interaction. As our long-term goals, we plan to use this metric as an optimization function to be used to potentially learn policies that perform better than the (imperfect) demonstrators.

Location: Henry Salvatori Computer Science Center (SAL) - 213
WebCast Link: https://usc.zoom.us/j/94560935551?pwd=ejY1UG1xTUZaQWJER1NOOUJNcGhQdz09
Audiences: Everyone Is Invited

Contact: Lizsl De Leon

This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.
Add to Google Calendar

Return to Calendar

Events Calendar

Ph.D. Thesis Proposal - Aniruddh G. Puranic