Multichannel Speech Enhancement: From Wiener Filtering to Neural Networks By Dr. Ivan Tashev

#processing #signal #guadalajara #filter #neuralnetworks
Share

Multichannel Speech Enhancement: From Wiener Filtering to Neural Networks By Dr. Ivan Tashev


Abstract: In this talk we will discuss the history and the modern state of sound capturing and speech enhancement. We will start with the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.



  Date and Time

  Location

  Hosts

  Registration



  • Date: 12 May 2023
  • Time: 06:00 PM to 07:00 PM
  • All times are (UTC-06:00) Guadalajara
  • Add_To_Calendar_icon Add Event to Calendar
If you are not a robot, please complete the ReCAPTCHA to display virtual attendance info.
  • Contact Event Host
  • Starts 31 March 2023 12:53 PM
  • Ends 12 May 2023 08:53 PM
  • All times are (UTC-06:00) Guadalajara
  • No Admission Charge


  Speakers

Dr. Ivan Tashev Dr. Ivan Tashev of Microsoft

Topic:

Multichannel Speech Enhancement: From Wiener Filtering to Neural Networks

Biography:

Dr. Ivan Tashev is a Partner Software Architect in Microsoft Research (MSR), Redmond, WA, USA, where he leads the Audio and Acoustics Research Group. His interests include multichannel signal processing and machine learning and artificial intelligence for signal processing. Ivan Tashev also coordinates the Brain-Computer Interfaces project in MSR. Dr. Tashev published two books, two book chapters, 100+ scientific papers, he is listed as inventor in 50 US patents. Ivan Tashev is affiliate professor in the Department for Electrical and Computer Engineering of University of Washington in Seattle, USA, and honorary professor at Technical University of Sofia, Bulgaria. Technologies created by Dr. Tashev are incorporated in many Microsoft products, he served as the audio architect for Kinect and for HoloLens. He is an IEEE Fellow, member of AES and ASA. 





Joint IEEE Signal Processing & Circuits and Systems Guadalajara Chapter