Multi-Speaker DOA Tracking Algorithm Utilizing Probability Hypothesis Density Filter and Weighted Histogram of SRP-PHAT

Yosef Soussana, Elior Hadad, Sharon Gannot

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This contribution presents a concurrent speakers' direction of arrival (DOA) tracking algorithm in reverberant environments. The algorithm is formulated in two stages, leveraging speech sparsity in the short-time Fourier transform (STFT) domain. In the first stage, sets of DOAs per batch of time frames are computed. Initially, a single narrow-band (NB) DOA per time-frequency (TF) bin is selected using the W-disjoint orthogonality property of speech. The NB DOA is obtained as the maximum of the steered response power phase transform (SRP-PHAT) localization spectrum at that TF bin, together with a quality measure describing the confidence in the estimation. A localization spectrum is obtained by combining the NB DOAs using a weighted histogram, with the quality measures serving as weights. The set of DOAs is determined by identifying peaks in the resulting localization spectrum. The collection of DOAs is modeled as a random finite set (RFS). In the second stage, the probability hypothesis density (PHD) filter is applied to estimate and track the speakers' DOAs over a collection of batches. Information from the first stage is utilized to calculate prior knowledge on the appearance of new speakers. Our experimental study demonstrates the superiority of the proposed algorithm over a baseline approach.

Original languageEnglish
Title of host publication2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages334-338
Number of pages5
ISBN (Electronic)9798350361858
DOIs
StatePublished - 2024
Event18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Aalborg, Denmark
Duration: 9 Sep 202412 Sep 2024

Publication series

Name2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings

Conference

Conference18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024
Country/TerritoryDenmark
CityAalborg
Period9/09/2412/09/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Fingerprint

Dive into the research topics of 'Multi-Speaker DOA Tracking Algorithm Utilizing Probability Hypothesis Density Filter and Weighted Histogram of SRP-PHAT'. Together they form a unique fingerprint.

Cite this