Multisensory speech enhancement in noisy environments using bone-conducted and air-conducted microphones

Mingzi Li, Israel Cohen, Saman Mousazadeh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we propose a speech enhancement algorithm for estimating the clean speech using samples of air-conducted and bone-conducted speech signals. We introduce a model in a supervised learning framework by approximating a mapping from concatenation of noisy air-conducted and boneconducted speech to clean speech in the short time Fourier transform domain. Two function extension schemes are utilized: geometric harmonics and Laplacian pyramid. Performances obtained from the two schemes are evaluated and compared in terms of spectrograms and log spectral distance measures.

Original languageEnglish
Title of host publication2014 IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014 - Proceedings
Pages1-5
Number of pages5
ISBN (Electronic)9781479954032
DOIs
StatePublished - 3 Sep 2014
Event2nd IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014 - Xi'an, China
Duration: 9 Jul 201413 Jul 2014

Publication series

Name2014 IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014 - Proceedings

Conference

Conference2nd IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014
Country/TerritoryChina
CityXi'an
Period9/07/1413/07/14

Keywords

  • Laplacian pyramid
  • Multisensory
  • bone-conducted microphone
  • geometric harmonics

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Signal Processing

Fingerprint

Dive into the research topics of 'Multisensory speech enhancement in noisy environments using bone-conducted and air-conducted microphones'. Together they form a unique fingerprint.

Cite this