A unified beamforming and source separation model for static and dynamic human-robot interaction

Jorge Wuth, Rodrigo Mahu, Israel Cohen, Richard M. Stern, Néstor Becerra Yoma

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a unified model for combining beamforming and blind source separation (BSS). The validity of the model's assumptions is confirmed by recovering target speech information in noise accurately using Oracle information. Using real static human-robot interaction (HRI) data, the proposed combination of BSS with the minimum-variance distortionless response beamformer provides a greater signal-to-noise ratio (SNR) than previous parallel and cascade systems that combine BSS and beamforming. In the difficult-to-model HRI dynamic environment, the system provides a SNR gain that was 2.8 dB greater than the results obtained with the cascade combination, where the parallel combination is infeasible.

Original languageEnglish
Article number035203
JournalJASA Express Letters
Volume4
Issue number3
DOIs
StatePublished - 1 Mar 2024

All Science Journal Classification (ASJC) codes

  • Music
  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'A unified beamforming and source separation model for static and dynamic human-robot interaction'. Together they form a unique fingerprint.

Cite this