On Generating synthetic voices for Low-Resource Languages

#synthetic #voices,resource #development,Large #databases #of #speech,linguistic #analysis #tools,Text #to #speech
Share

A lot of progress has been made in the field of speech synthesis in the last few years. Today's synthetic voices sound natural and are quite intelligible. A lot of this progress can be attributed to "resource development" in languages. Large databases of speech, linguistic analysis tools, pronunciation dictionaries, text processing rules, etc. can help make synthetic speech significantly better. However, this privileged "resource development" is only enjoyed by a few languages of the world. Text to speech (TTS) has become a common utility today. We now hear synthetic voices at airports, railway stations, in our GPS navigation devices, and even in toys. This phenomenon has led to an increasing demand for deploying TTS in localized languages. Most of these languages belong to the "low-resource" category. Although linguistic resource development work is happening at a fast pace for Indian languages, for many tasks, they still fall in the low resource category. In this talk, we will take a look at some challenges in building TTS systems for low resource languages. We will discuss in detail the challenge of inserting pauses between words (phrasing) in synthetic speech and look at novel data driven techniques that also work in low resource scenarios.



  Date and Time

  Location

  Hosts

  Registration



  • Add_To_Calendar_icon Add Event to Calendar
  • IIIT A, Jhalwa Campus
  • Deoghat
  • Allahabad, Uttar Pradesh
  • India 211012
  • Building: Auditorium

  • Contact Event Host
  • Co-sponsored by IEEE-RAS
  • Starts 01 August 2015 02:30 AM UTC
  • Ends 08 August 2015 05:30 PM UTC
  • No Admission Charge


  Speakers

ALOK PARLIKAR

Topic:

On Generating synthetic voices for Low-Resource Languages

A lot of progress has been made in the field of speech synthesis in the last few years. Today's synthetic voices sound natural and are quite intelligible. A lot of this progress can be attributed to "resource development" in languages. Large databases of speech, linguistic analysis tools, pronunciation dictionaries, text processing rules, etc. can help make synthetic speech significantly better. However, this privileged "resource development" is only enjoyed by a few languages of the world. Text to speech (TTS) has become a common utility today. We now hear synthetic voices at airports, railway stations, in our GPS navigation devices, and even in toys. This phenomenon has led to an increasing demand for deploying TTS in localized languages. Most of these languages belong to the "low-resource" category. Although linguistic resource development work is happening at a fast pace for Indian languages, for many tasks, they still fall in the low resource category. In this talk, we will take a look at some challenges in building TTS systems for low resource languages. We will discuss in detail the challenge of inserting pauses between words (phrasing) in synthetic speech and look at novel data driven techniques that also work in low resource scenarios.

Biography:

Dr Alok Parlikar graduated from the Premier Indian Institute of Information Technology, Allahabad with a gold medal. Subsequently he joined EPFL University , Switzerland for post graduate research before joining the computer science department of Carnegie Mellon University, USA from where he received his MS and Ph.D degree. He aslo worked at Amazon, USA for one year.