Single Channel Speech Enhancement: From Wiener Filtering to Neural Networks
In this talk we will discuss the history and the modern state of sound capturing and speech enhancement. We will start with the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.
Date and Time
Location
Hosts
Registration
- Date: 29 Mar 2023
- Time: 12:00 PM to 01:00 PM
- All times are (GMT-05:00) US/Eastern
-
Add Event to Calendar
- Contact Event Hosts
- Co-sponsored by Fairleigh Dickinson University
- Starts 02 March 2023 08:00 PM
- Ends 29 March 2023 01:00 PM
- All times are (GMT-05:00) US/Eastern
- No Admission Charge
Speakers
Dr. Ivan Tashev of Microosoft Research, WA, USA
Single Channel Speech Enhancement: From Wiener Filtering to Neural Networks
In this talk we will discuss the history and the modern state of sound capturing and speech enhancement. We will start with the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.
Biography:
Dr. Ivan Tashev is a Partner Software Architect in Microsoft Research (MSR), Redmond, WA, USA, where he leads the Audio and Acoustics Group. His interests include multichannel signal processing using machine learning and AI approaches. Ivan Tashev also coordinates the Brain-Computer Interfaces project in MSR. Dr. Tashev is affiliate professor in the Department for Electrical and Computer Engineering of University of Washington in Seattle, USA, and honorary professor at Technical University of Sofia, Bulgaria. Technologies created by Ivan Tashev are incorporated in many Microsoft products, he served as the audio architect for Kinect and for HoloLens. He is an IEEE Fellow, member of AES and ASA. More details about him can be found in his web page https://www.microsoft.com/en-us/research/people/ivantash/
Address:United States
Agenda
In this talk we will discuss the history and the modern state of sound capturing and speech enhancement. We will start with the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.