TY - GEN
T1 - Combined Weighted Prediction Error and Minimum Variance Distortionless Response for dereverberation
AU - Cohen, Alejandro
AU - Stemmer, Georg
AU - Ingalsuo, Seppo
AU - Markovich-Golan, Shmulik
N1 - Publisher Copyright: © 2017 IEEE.
PY - 2017/6/16
Y1 - 2017/6/16
N2 - Considering the dereverberation problem using multichannel processing, two main paradigms exist. The first paradigm utilizes the long-term correlation of the reverberant component for reducing it, e.g. Weighted Prediction Error (WPE) [1]. The second paradigm, treats the reverberation as a diffuse noise field, statically independent of the direct speech component, and aims to reduce it using a superdirective beamformer, e.g. [2]. Here we propose to combine the two paradigms in a two-stages algorithm. The first stage comprises of the WPE method, and the second stage comprises of a Minimum Variance Distortionless Response (MVDR) beamformer for treating the residual reverberant component. We conjecture that the coherence of the reverberant component at the output of the WPE is similar to the coherence of the reverberant component at the microphones which should theoretically correspond to a diffuse noise field. By estimating the coherence from the reverberant components, linearly predicted by the WPE, non-ideal factors such as microphone positions errors, non-equalized frequency responses and acoustic shading are accounted for. The advantageous performance of the proposed method is exemplified in an experiment study using simulations.
AB - Considering the dereverberation problem using multichannel processing, two main paradigms exist. The first paradigm utilizes the long-term correlation of the reverberant component for reducing it, e.g. Weighted Prediction Error (WPE) [1]. The second paradigm, treats the reverberation as a diffuse noise field, statically independent of the direct speech component, and aims to reduce it using a superdirective beamformer, e.g. [2]. Here we propose to combine the two paradigms in a two-stages algorithm. The first stage comprises of the WPE method, and the second stage comprises of a Minimum Variance Distortionless Response (MVDR) beamformer for treating the residual reverberant component. We conjecture that the coherence of the reverberant component at the output of the WPE is similar to the coherence of the reverberant component at the microphones which should theoretically correspond to a diffuse noise field. By estimating the coherence from the reverberant components, linearly predicted by the WPE, non-ideal factors such as microphone positions errors, non-equalized frequency responses and acoustic shading are accounted for. The advantageous performance of the proposed method is exemplified in an experiment study using simulations.
UR - http://www.scopus.com/inward/record.url?scp=85023758463&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2017.7952195
DO - 10.1109/ICASSP.2017.7952195
M3 - منشور من مؤتمر
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 446
EP - 450
BT - 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017
Y2 - 5 March 2017 through 9 March 2017
ER -