Logo: University of Southern California

Events Calendar


  • PhD Defense - Mrinal Kalakrishnan

    Tue, Nov 26, 2013 @ 12:00 PM - 02:00 PM

    Thomas Lord Department of Computer Science

    University Calendar


    PhD Candidate: Mrinal Kalakrishnan

    Committee members:

    Stefan Schaal (chair)
    Gaurav Sukhatme
    Francisco Valero-Cuevas (outside member)

    Time: Nov 26th 12:00pm
    Location: RTH 422

    Title: Learning objective functions for autonomous motion generation

    Abstract:

    Planning and optimization methods have been widely applied to the problem of trajectory generation for autonomous robotics. The performance of such methods, however, is critically dependent on the choice of objective function being optimized, which is non-trivial to design. On the other hand, efforts on learning autonomous behavior from user-provided demonstrations have largely been focused on reproducing behavior similar in appearance to the demonstrations, which often fails to generalize well to new situations. An alternative approach, known as Inverse Reinforcement Learning (IRL), is to learn an objective function that the demonstrations are assumed to be optimal under. With the help of a planner or trajectory optimizer, such an approach allows the system to synthesize novel behavior in situations that were not experienced in the demonstrations.

    We present novel algorithms for IRL that have successfully been applied in two real-world, competitive robotics settings: (1) In the domain of rough terrain quadruped locomotion, we present an algorithm that learns an objective function for foothold selection based on "terrain templates". The learner automatically generates and selects the appropriate features which form the objective function, which reduces the need for feature engineering while attaining a high level of generalization. (2) For the domain of autonomous manipulation, we present a probabilistic model of optimal trajectories, which results in new algorithms for inverse reinforcement learning and trajectory optimization in high-dimensional settings. We apply this method to two problems in robotic manipulation: redundancy resolution in inverse kinematics, and trajectory optimization for grasping and manipulation.

    Both methods have proven themselves as part of larger integrated systems in competitive settings against other teams, where testing was conducted by an independent test team in situations that were not seen during training.

    Location: Ronald Tutor Hall of Engineering (RTH) - 422

    Audiences: Everyone Is Invited

    Contact: Lizsl De Leon

    Add to Google CalendarDownload ICS File for OutlookDownload iCal File

Return to Calendar