Single Channel Speech Enhancement: From Wiener Filtering to Neural Networks

#"Single #Channel #Speech #Enhancement: #From #Wiener #Filtering #to #Neural #Networks" #by #Dr. #Ivan #Tashev #Software #Architect #from #Microsoft #Research.
Share

In this talk we will discuss the history and the modern state of sound capturing and speech enhancement. We will start with the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.



  Date and Time

  Location

  Hosts

  Registration



  • Date: 29 Mar 2023
  • Time: 12:00 PM to 01:00 PM
  • All times are (GMT-05:00) US/Eastern
  • Add_To_Calendar_icon Add Event to Calendar
If you are not a robot, please complete the ReCAPTCHA to display virtual attendance info.
  • Contact Event Hosts
  • Co-sponsored by Fairleigh Dickinson University
  • Starts 02 March 2023 08:00 PM
  • Ends 29 March 2023 01:00 PM
  • All times are (GMT-05:00) US/Eastern
  • No Admission Charge


  Speakers

Dr. Ivan Tashev Dr. Ivan Tashev of Microosoft Research, WA, USA

Topic:

Single Channel Speech Enhancement: From Wiener Filtering to Neural Networks

In this talk we will discuss the history and the modern state of sound capturing and speech enhancement. We will start with the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.

Biography:

Dr. Ivan Tashev is a Partner Software Architect in Microsoft Research (MSR), Redmond, WA, USA, where he leads the Audio and Acoustics Group. His interests include multichannel signal processing using machine learning and AI approaches. Ivan Tashev also coordinates the Brain-Computer Interfaces project in MSR. Dr. Tashev is affiliate professor in the Department for Electrical and Computer Engineering of University of Washington in Seattle, USA, and honorary professor at Technical University of Sofia, Bulgaria. Technologies created by Ivan Tashev are incorporated in many Microsoft products, he served as the audio architect for Kinect and for HoloLens. He is an IEEE Fellow, member of AES and ASA. More details about him can be found in his web page https://www.microsoft.com/en-us/research/people/ivantash/

Address:United States





Agenda

In this talk we will discuss the history and the modern state of sound capturing and speech enhancement. We will start with the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.