Robust multichannel TDOA estimation for speaker localization using the impulsive characteristics of speech spectrum

Hongsen He, Jingdong Chen, Jacob Benesty, Yingyue Zhou, Tao Yang

科研成果: 书/报告/会议事项章节会议稿件同行评审

13 引用 (Scopus)

摘要

Time delay estimation (TDE) plays an important role in localizing and tracking radiating acoustic sources. Although many efforts have been devoted to this problem in the literature, the robustness of TDE with respect to noise and reverberation remains a great challenge for practical systems. In this paper, we investigate the TDE problem in acoustic single-input/multiple-output (SIMO) systems in reverberant and noisy environments. We first define a Cauchy estimator in the frequency domain, which is robust in dealing with speech as the SIMO system's excitation. This robust estimator is then used to construct a cost function, from which a robust multichannel frequency-domain adaptive filter is deduced. This adaptive algorithm is subsequently employed to blindly identify the acoustic impulse responses between the source and the microphones. Finally, the time difference of arrival is determined from the identified channel responses.

源语言英语
主期刊名2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
6130-6134
页数5
ISBN(电子版)9781509041176
DOI
出版状态已出版 - 16 6月 2017
活动2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - New Orleans, 美国
期限: 5 3月 20179 3月 2017

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷版)1520-6149

会议

会议2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017
国家/地区美国
New Orleans
时期5/03/179/03/17

指纹

探究 'Robust multichannel TDOA estimation for speaker localization using the impulsive characteristics of speech spectrum' 的科研主题。它们共同构成独一无二的指纹。

引用此