TY - GEN
T1 - Relative transfer function modeling for supervised source localization
AU - Laufer, Bracha
AU - Talmon, Ronen
AU - Gannot, Sharon
PY - 2013
Y1 - 2013
N2 - Speaker localization is one of the most prevalent problems in speech processing. Despite significant efforts in the last decades, high reverberation level still limits the performance of localization algorithms. Furthermore, using conventional localization methods, the information that can be extracted from dual microphone measurements is restricted to the time difference of arrival (TDOA). Under far-field regime, this is equivalent to either azimuth or elevation angles estimation. Full description of speaker's coordinates necessitates several microphones. In this contribution we tackle these two limitations by taking a manifold learning perspective for system identification. We present a training-based algorithm, motivated by the concept of diffusion maps, that aims at recovering the fundamental controlling parameters driving the measurements. This approach turns out to be more robust to reverberation, and capable of recovering the speech source location using merely two microphones signals.
AB - Speaker localization is one of the most prevalent problems in speech processing. Despite significant efforts in the last decades, high reverberation level still limits the performance of localization algorithms. Furthermore, using conventional localization methods, the information that can be extracted from dual microphone measurements is restricted to the time difference of arrival (TDOA). Under far-field regime, this is equivalent to either azimuth or elevation angles estimation. Full description of speaker's coordinates necessitates several microphones. In this contribution we tackle these two limitations by taking a manifold learning perspective for system identification. We present a training-based algorithm, motivated by the concept of diffusion maps, that aims at recovering the fundamental controlling parameters driving the measurements. This approach turns out to be more robust to reverberation, and capable of recovering the speech source location using merely two microphones signals.
KW - acoustic source localization
KW - diffusion kernel
KW - manifold learning
KW - relative transfer function
UR - http://www.scopus.com/inward/record.url?scp=84893578973&partnerID=8YFLogxK
U2 - 10.1109/waspaa.2013.6701829
DO - 10.1109/waspaa.2013.6701829
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84893578973
SN - 9781479909728
T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
BT - 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013
T2 - 2013 14th IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013
Y2 - 20 October 2013 through 23 October 2013
ER -