USC - Viterbi School of Engineering

Nov
26

PhD Defense - Mrinal Kalakrishnan
Tue, Nov 26, 2013 @ 12:00 PM - 02:00 PM
Thomas Lord Department of Computer Science
University Calendar

PhD Candidate: Mrinal Kalakrishnan

Committee members:

Stefan Schaal (chair)
Gaurav Sukhatme
Francisco Valero-Cuevas (outside member)

Time: Nov 26th 12:00pm
Location: RTH 422

Title: Learning objective functions for autonomous motion generation

Abstract:

Planning and optimization methods have been widely applied to the problem of trajectory generation for autonomous robotics. The performance of such methods, however, is critically dependent on the choice of objective function being optimized, which is non-trivial to design. On the other hand, efforts on learning autonomous behavior from user-provided demonstrations have largely been focused on reproducing behavior similar in appearance to the demonstrations, which often fails to generalize well to new situations. An alternative approach, known as Inverse Reinforcement Learning (IRL), is to learn an objective function that the demonstrations are assumed to be optimal under. With the help of a planner or trajectory optimizer, such an approach allows the system to synthesize novel behavior in situations that were not experienced in the demonstrations.

We present novel algorithms for IRL that have successfully been applied in two real-world, competitive robotics settings: (1) In the domain of rough terrain quadruped locomotion, we present an algorithm that learns an objective function for foothold selection based on "terrain templates". The learner automatically generates and selects the appropriate features which form the objective function, which reduces the need for feature engineering while attaining a high level of generalization. (2) For the domain of autonomous manipulation, we present a probabilistic model of optimal trajectories, which results in new algorithms for inverse reinforcement learning and trajectory optimization in high-dimensional settings. We apply this method to two problems in robotic manipulation: redundancy resolution in inverse kinematics, and trajectory optimization for grasping and manipulation.

Both methods have proven themselves as part of larger integrated systems in competitive settings against other teams, where testing was conducted by an independent test team in situations that were not seen during training.

Location: Ronald Tutor Hall of Engineering (RTH) - 422
Audiences: Everyone Is Invited

Contact: Lizsl De Leon

This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.
Add to Google Calendar

Return to Calendar

Events Calendar

PhD Defense - Mrinal Kalakrishnan