Logo: University of Southern California

Events Calendar


  • Insights on Latent Perceptual Indexing with Applications in Audio and Speech Recognition

    Thu, Dec 09, 2010 @ 10:00 AM - 12:00 PM

    Ming Hsieh Department of Electrical and Computer Engineering

    Conferences, Lectures, & Seminars


    Speaker: Shiva Sundaram, Senior Research Scientist/ Deutsche Telekom Laboratories (T-Labs), Berlin, Germany

    Talk Title: Insights on Latent Perceptual Indexing with Applications in Audio and Speech Recognition

    Abstract: One of the main ideas that originated from my thesis work is latent indexing applied to content-based audio retrieval. Coined as Latent Perceptual Indexing/Mapping, it fundamentally uses the information in weighted unit-document co-occurrence measures. The procedure is analogous to latent semantic indexing of text documents except the bag-of-features from the audio clips constitute the documents and the units are obtained by clustering those documents. In this talk, I will present improvements to the basic approach and also present recent results on its application to acoustic modelling for speech recognition. I will also take this opportunity to talk about my related research efforts in affect-based retrieval of audio, salient-event detection in video and natural speech interfaces.

    Biography: Shiva Sundaram received his PhD and his MS, both in Electrical Engineering from the University of Southern California (USC) in 2008, and 2003 respectively. He received his Bachelor of Engineering (B.E) degree in Electronics Engineering from the University of Pune, India in 2001. Since November 2008 he has been a Senior Research Scientist with Deutsche Telekom Laboratories (T-Labs) in Berlin, Germany. Before joining T-Labs, he was a research intern in the Speech and Language Technologies Group at Apple. From summer 2002 to fall 2008 he was a research assistant with Prof. Shrikanth Narayanan in the Signal Analysis and Interpretation Lab (SAIL) at the University of Southern California (USC), Los Angeles. His research interests in the area of speech and audio processing includes recognition and synthesis of speech, signal processing for multimedia retrieval, audio perception, and pattern recognition. He has published over 25 scientific articles in international conferences and journals. In 2006, he received the best student paper award in IEEE MMSP workshop for his work in music information retrieval.

    Host: Professor Shrikanth Narayanan

    Location: Hughes Aircraft Electrical Engineering Center (EEB) - 248

    Audiences: Everyone Is Invited

    Contact: Mary Francis

    Add to Google CalendarDownload ICS File for OutlookDownload iCal File

Return to Calendar