USC - Viterbi School of Engineering

Oct
06

Seminar Series: Reinforcement Learning and Markov Chain Computations
Tue, Oct 06, 2009 @ 02:00 PM - 03:00 PM
Ming Hsieh Department of Electrical and Computer Engineering
Conferences, Lectures, & Seminars

Prof. Vivek Borkar, Tata Institute of Fundamental Research (TIFR), Mumbai, India This two part series shall cover an introduction to reinforcement learning and stochastic approximations, and its application to Markov Chain computations.Part I (Tuesday, Oct. 6) shall highlight the main strands in the reinforcement learning based approaches to approximate dynamic programming for Markov decision processes. In particular, connections to numerical methods for MDPs and convergence issues will be discussed.Speaker Bio:
Vivek Borkar is a Professor in the School of Technology and Computer Science at the Tata Institute of Fundamental Research (TIFR), Mumbai where he has been for the last decade.
He was formerly Dean of the same school. Prior to TIFR, he was a Professor in the Computer Science and Automation department of the Indian Institute of Science, Bangalore. He received his Ph.D. from University of California, Berkeley in EECS in 1979. He is well-known for his work in many areas including stochastic processes, mathematical control, game theory and learning. He is the author of several books including a recent book on Stochastic approximations: A Dynamical Systems Viewpoint.Host: Prof. Rahul Jain, 213-740-2246, rahul.jain@usc.edu. If you would like to meet the speaker during his weeklong visit from October 5-9, please contact the host.
Location: Hedco Petroleum and Chemical Engineering Building (HED) - 116
Audiences: Everyone Is Invited

Contact: Annie Yu

This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.
Add to Google Calendar

Return to Calendar

Events Calendar

Seminar Series: Reinforcement Learning and Markov Chain Computations