BEGIN:VCALENDAR BEGIN:VEVENT SUMMARY:Ph.D. Thesis Proposal - Aniruddh G. Puranic DESCRIPTION:Candidate: Aniruddh G. Puranic\n \n Thesis title: Learning from Demonstrations with Temporal Logics\n \n Committee: Jyotirmoy V. Deshmukh, Stefanos Nikolaidis, Gaurav Sukhatme, Mukund Raghothaman, Somil Bansal, Julie Shah (MIT)\n \n Date: June 22, 2022 (Wednesday)\n Time: 12pm - 2pm Pacific Time\n Location: SAL 213\n \n Abstract:\n \n Learning-from-demonstrations (LfD) is a popular paradigm to obtain effective robot control policies for complex tasks via reinforcement learning without the need to explicitly design reward functions. However, it is susceptible to imperfections in demonstrations and raises concerns of safety and interpretability in the learned control policies. To address these issues, we propose to use Signal Temporal Logic (STL) to express high-level robotic tasks and use its quantitative semantics to evaluate and rank the quality of demonstrations. Temporal logic-based specifications allow us to create non-Markovian rewards and are also capable of defining interesting causal dependencies between tasks such as sequential task specifications. We present our completed work which proposed the LfD-STL framework that learns from even suboptimal/imperfect demonstrations and STL specifications to infer rewards on which reinforcement learning can be performed to obtain control policies. Through numerous experiments, we have shown that our approach outperforms prior LfD methods.\n \n We then propose further extensions to this framework to develop metrics that provide intuitive explanations about demonstrators' behaviors, which combined with the interpretability of the learned robot policies, can help in building a safe and trusted robotic system for human interaction. As our long-term goals, we plan to use this metric as an optimization function to be used to potentially learn policies that perform better than the (imperfect) demonstrators.\n DTSTART:20220622T120000 LOCATION:SAL 213 URL;VALUE=URI: DTEND:20220622T140000 END:VEVENT END:VCALENDAR