TY - GEN
T1 - LOW RESOURCES ONLINE SINGLE-MICROPHONE SPEECH ENHANCEMENT WITH HARMONIC EMPHASIS
AU - Raviv, Nir
AU - Schwartz, Ofer
AU - Gannot, Sharon
N1 - Publisher Copyright: © 2022 IEEE
PY - 2022
Y1 - 2022
N2 - In this paper, we propose a deep neural network (DNN)-based single-microphone speech enhancement algorithm characterized by a short latency and low computational resources. Many speech enhancement algorithms suffer from low noise reduction capabilities between pitch harmonics, and in severe cases, the harmonic structure may even be lost. Recognizing this drawback, we propose a new weighted loss that emphasizes pitch-dominated frequency bands. For that, we propose a method, applied only at the training stage, to detect these frequency bands. The proposed method is applied to speech signals contaminated by several noise types, and in particular, typical domestic noise drawn from ESC-50 and DEMAND databases, demonstrating its applicability to 'stay-at-home' scenarios.
AB - In this paper, we propose a deep neural network (DNN)-based single-microphone speech enhancement algorithm characterized by a short latency and low computational resources. Many speech enhancement algorithms suffer from low noise reduction capabilities between pitch harmonics, and in severe cases, the harmonic structure may even be lost. Recognizing this drawback, we propose a new weighted loss that emphasizes pitch-dominated frequency bands. For that, we propose a method, applied only at the training stage, to detect these frequency bands. The proposed method is applied to speech signals contaminated by several noise types, and in particular, typical domestic noise drawn from ESC-50 and DEMAND databases, demonstrating its applicability to 'stay-at-home' scenarios.
KW - DNN
KW - Ideal ratio mask
KW - Single-microphone speech enhancement
KW - Speech harmonics presence detection
UR - http://www.scopus.com/inward/record.url?scp=85134033346&partnerID=8YFLogxK
U2 - 10.1109/ICASSP43922.2022.9747656
DO - 10.1109/ICASSP43922.2022.9747656
M3 - منشور من مؤتمر
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 8807
EP - 8811
BT - 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022
Y2 - 22 May 2022 through 27 May 2022
ER -