Fri, Aug 11, 2017 @ 03:00 PM - 04:00 PM
Information Sciences Institute
Conferences, Lectures, & Seminars
Speaker: Nima Pourdamghani, USC/ISI
Talk Title: Improving machine translation from low resource languages
Series: Natural Language Seminar
Abstract: Statistical machine translation MT often needs a large corpus of parallel translated sentences in order to achieve good performance. This limits the use of current MT technologies to a few resource rich languages. Assume an incident happens in an area with a low-resource language. For a quick response, we need to build an MT system with available data, as finding or translating new parallel data is expensive and time consuming. For many languages this means that we only have a small amount of often out-of-domain parallel data e.g. a Bible or Ubuntu manual. This talk is about ways to improve machine translation in low resource scenarios. I'll talk about use of monolingual data and parallel data from related languages to improve machine translation from the low resource language into English.
Biography: Nima Pourdamghani is a fourth year Ph.D. student at ISI. He works with Professor Kevin Knight on machine translation from low resource languages.
Host: Nima Pourdamghani
Audiences: Everyone Is Invited
Contact: Peter Zamar