USC - Viterbi School of Engineering

Feb
22

On Representing Acoustics of Speech for Speech Processing
Tue, Feb 22, 2011 @ 10:00 AM - 12:00 PM
Ming Hsieh Department of Electrical and Computer Engineering
Conferences, Lectures, & Seminars

Speaker: Bishnu S. Atal, Dept. of Electrical Engineering, Univ. of Washington, Seattle, WA

Talk Title: On Representing Acoustics of Speech for Speech Processing

Abstract: Most methods for analyzing speech start by transforming the acoustic time-domain signal into spectral form. The short-time Fourier transform provides a representation of the time-varying characteristics of the signal and has a long history. There are many issues, such as the size and shape of the window, that remain unresolved. The use of a relatively short window is widespread. In early development of the sound spectrograph, use of both narrow and wideband analysis was quite common, but the narrow-band analysis faded away. In digital speech coding applications (multipulse and code-excited linear prediction), high-quality speech is produced at low bit rates only when prediction using both short and long intervals is used. What are the issues that arise in using a short or a long window? What are the relative advantages or disadvantages? In this talk, we will discuss these topics and present results that suggest that a short-time Fourier transform using long windows has advantages. In most speech representations, the Fourier components are not used directly but converted to their magnitude spectrum; the so-called phase is considered to be irrelevant. There are open questions regarding the use of phase information and we will discuss this important issue in the talk.

Biography: Bishnu S. Atal is an Affiliate Professor in the Electrical Engineering Department at the University of Washington, Seattle, WA. He retired in March 2002 after working for more than 40 years at Lucent Bell Labs, and AT&T Labs. He was a Technical Director at the AT&T Shannon Laboratory, Florham Park, New Jersey, from 1997 where he was engaged in research in speech coding and in automatic speech recognition. He joined the technical staff of AT&T Bell Laboratories in 1961, became head of Acoustics Research Department in 1985, and head of Speech Research Department in 1990.
He is internationally recognized for his many contributions to speech analysis, synthesis, and coding. His pioneering work in linear predictive coding of speech established linear prediction as one of the most important speech analysis technique leading to many applications in coding, recognition and synthesis of speech. His research work is documented in over 90 technical papers and he holds 17 U.S. and numerous international patents in speech processing.
He was elected to the National Academy of Engineering in 1987 and to the National Academy of Sciences in 1993. He is a Fellow of the Acoustical Society of America and the IEEE. He received the IEEE Morris N. Liebmann Memorial Field Award in 1986, the Thomas Edison Patent Award from the R&D Council of New Jersey in 1994, New Jersey Inventors Hall of Fame Inventor of the Year Award in 2000 and the Benjamin Franklin Medal in Electrical Engineering in 2003.
Bishnu lives in Mukilteo, Washington.

Host: Sanjit Mitra and Shrikanth Narayanan

Location: Hughes Aircraft Electrical Engineering Center (EEB) - 248
Audiences: Everyone Is Invited

Contact: Mary Francis

This event is open to all eligible individuals. USC Viterbi operates all of its activities consistent with the University's Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.
Add to Google Calendar

Return to Calendar

Events Calendar

On Representing Acoustics of Speech for Speech Processing