Localization of an Unknown Number of Speakers in Adverse Acoustic Conditions Using Reliability Information and Diarization

Andreas Brendel, Bracha Laufer-Goldshtein, Sharon Gannot, Ronen Talmon, Walter Kellermann

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

This paper investigates localization of an arbitrary number of simultaneously active speakers in an acoustic enclosure. We propose an algorithm capable of estimating the number of speakers, using reliability information to obtain robust estimation results in adverse acoustic scenarios and estimating individual probability distributions describing the position of each speaker using convex geometry tools. To this end, we start from an established algorithm for localization of acoustic sources based on the EM algorithm. There, the estimation of the number of sources as well as the handling of reverberation has not been addressed sufficiently. We show improvement in the localization of a higher number of sources and in the robustness in adverse conditions including interference from competing speakers, reverberation and noise.

Original languageEnglish
Title of host publication2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages7898-7902
Number of pages5
ISBN (Electronic)9781479981311
DOIs
StatePublished - May 2019
Event44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Brighton, United Kingdom
Duration: 12 May 201917 May 2019

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2019-May
ISSN (Print)1520-6149

Conference

Conference44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019
Country/TerritoryUnited Kingdom
CityBrighton
Period12/05/1917/05/19

Bibliographical note

Publisher Copyright:
© 2019 IEEE.

Funding

This work was supported by DFG under contract no <Ke890/10-1> within the Research Unit FOR2457 ”Acoustic Sensor Networks”

FundersFunder number
Deutsche Forschungsgemeinschaft<Ke890/10-1

    Keywords

    • Acoustic source localization
    • EM algorithm
    • diarization
    • number of speakers estimation

    Fingerprint

    Dive into the research topics of 'Localization of an Unknown Number of Speakers in Adverse Acoustic Conditions Using Reliability Information and Diarization'. Together they form a unique fingerprint.

    Cite this