Multi-microphone speech enhancement informed by auditory scene analysis

Axel Plinge, Sharon Gannot

פרסום מחקרי: פרק בספר / בדוח / בכנספרסום בספר כנסביקורת עמיתים

תקציר

A multitude of multi-microphone speech enhancement methods is available. In this paper, we focus our attention to the well-known minimum variance distortionless response (MVDR) beamformer, due to its ability to preserve distortionless response towards the desired speaker while minimizing the output noise power. We explore two alternatives for constructing the steering vectors towards the desired speech source. One is only using the direct path of the speech propagation in the form of delay-only filters, while the other is using the entire room impulse response (RIR). All beamforming methods requires some control information to be able to accomplish the task of enhancing a desired speech signal. In this paper, an acoustic event detection method using biologically-inspired features is employed. It can interpret the auditory scene by detecting the presence of different auditory objects. This is employed to control the estimation procedures used by beamformer. The resulting system provides a blind method of speech enhancement that can improve intelligibility independently of any additional information. Experiments with real recordings show the practical applicability of the method. Significant gain in fwSNRseg is achieved. Compared to using the direct path only, the use of the entire RIR proves beneficial.

שפה מקוריתאנגלית
כותר פרסום המארח2016 IEEE Sensor Array and Multichannel Signal Processing Workshop, SAM 2016
מוציא לאורIEEE Computer Society
מסת"ב (אלקטרוני)9781509021031
מזהי עצם דיגיטלי (DOIs)
סטטוס פרסוםפורסם - 15 ספט׳ 2016
אירוע2016 IEEE Sensor Array and Multichannel Signal Processing Workshop, SAM 2016 - Rio de Rio de Janeiro, ברזיל
משך הזמן: 10 יולי 201613 יולי 2016

סדרות פרסומים

שםProceedings of the IEEE Sensor Array and Multichannel Signal Processing Workshop
כרך2016-September

כנס

כנס2016 IEEE Sensor Array and Multichannel Signal Processing Workshop, SAM 2016
מדינה/אזורברזיל
עירRio de Rio de Janeiro
תקופה10/07/1613/07/16

ASJC Scopus subject areas

  • ???subjectarea.asjc.1700.1711???
  • ???subjectarea.asjc.2200.2207???
  • ???subjectarea.asjc.2200.2208???

טביעת אצבע

להלן מוצגים תחומי המחקר של הפרסום 'Multi-microphone speech enhancement informed by auditory scene analysis'. יחד הם יוצרים טביעת אצבע ייחודית.

פורמט ציטוט ביבליוגרפי