Deep-Learning Framework for Efficient Real-Time Speech Enhancement and Dereverberation

Tomer Rosenbaum, Emil Winebrand, Omer Cohen, Israel Cohen

Research output: Contribution to journalArticlepeer-review

Abstract

Deep learning has revolutionized speech enhancement, enabling impressive high-quality noise reduction and dereverberation. However, state-of-the-art methods often demand substantial computational resources, hindering their deployment on edge devices and in real-time applications. Computationally efficient approaches like deep filtering and Deep Filter Net offer an attractive alternative by predicting linear filters instead of directly estimating the clean speech. While Deep Filter Net excels in noise reduction, its dereverberation performance remains limited. In this paper, we present a generalized framework for computationally efficient speech enhancement and, based on this framework, identify an inherent constraint within Deep Filter Net that hinders its dereverberation capabilities. We propose an extension to the Deep Filter Net framework designed to overcome this limitation, demonstrating significant improvements in dereverberation performance while maintaining competitive noise-reduction quality. Our experimental results highlight the potential of this enhanced framework for real-time speech enhancement on resource-constrained devices.

Original languageEnglish
Article number630
JournalSensors
Volume25
Issue number3
DOIs
StatePublished - Feb 2025

Keywords

  • deep filtering
  • real-time processing
  • speech dereverberation
  • speech enhancement

All Science Journal Classification (ASJC) codes

  • Analytical Chemistry
  • Information Systems
  • Atomic and Molecular Physics, and Optics
  • Biochemistry
  • Instrumentation
  • Electrical and Electronic Engineering

Cite this