Multi-Speaker Direction of Arrival Estimation using SRP-PHAT Algorithm with a Weighted Histogram

Elior Hadad, Sharon Gannot

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A direction of arrival (DOA) estimator for concurrent speakers in a reverberant environment is presented. The DOA estimation task is formulated in the short-Time Fourier transform (STFT) in two stages. In the first stage, a single narrow-band DOA per time-frequency (T-F) is selected, since the speech sources are assumed to exhibit disjoint activity in the STFT domain. The narrow-band DOA is obtained as the maximum of the narrow-band steered response power phase transform (SRP-PHAT) localization spectrum at that T-F bin. In addition, for each narrow-band DOA, a quality measure is calculated, which provides the confidence in the estimated decision. In the second stage, the wide-band localization spectrum is calculated using a weighted histogram of the narrow-band DOAs with the quality measures as weight. Finally, the wide-band DOA estimation is obtained by selecting the peaks in the wide-band localization spectrum. The results of our experimental study demonstrate the benefit of the proposed algorithm as compared to the wide-band SRP-PHAT algorithm in a reverberant environment.

Original languageEnglish
Title of host publication2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538663783
DOIs
StatePublished - 2 Jul 2018
Event2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018 - Eilat, Israel
Duration: 12 Dec 201814 Dec 2018

Publication series

Name2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018

Conference

Conference2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018
Country/TerritoryIsrael
CityEilat
Period12/12/1814/12/18

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Multi-Speaker Direction of Arrival Estimation using SRP-PHAT Algorithm with a Weighted Histogram'. Together they form a unique fingerprint.

Cite this