Relative transfer function modeling for supervised source localization

Bracha Laufer, Ronen Talmon, Sharon Gannot

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

35 Scopus citations

Abstract

Speaker localization is one of the most prevalent problems in speech processing. Despite significant efforts in the last decades, high reverberation level still limits the performance of localization algorithms. Furthermore, using conventional localization methods, the information that can be extracted from dual microphone measurements is restricted to the time difference of arrival (TDOA). Under far-field regime, this is equivalent to either azimuth or elevation angles estimation. Full description of speaker's coordinates necessitates several microphones. In this contribution we tackle these two limitations by taking a manifold learning perspective for system identification. We present a training-based algorithm, motivated by the concept of diffusion maps, that aims at recovering the fundamental controlling parameters driving the measurements. This approach turns out to be more robust to reverberation, and capable of recovering the speech source location using merely two microphones signals.

Original languageEnglish
Title of host publication2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013
DOIs
StatePublished - 2013
Event2013 14th IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013 - New Paltz, NY, United States
Duration: 20 Oct 201323 Oct 2013

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Conference

Conference2013 14th IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013
Country/TerritoryUnited States
CityNew Paltz, NY
Period20/10/1323/10/13

Keywords

  • acoustic source localization
  • diffusion kernel
  • manifold learning
  • relative transfer function

Fingerprint

Dive into the research topics of 'Relative transfer function modeling for supervised source localization'. Together they form a unique fingerprint.

Cite this