Binaural localization of speech sources in the median plane using cepstral hrtf extraction

Dumidu S. Talagala, Xiang Wu, Wen Zhang, Thushara D. Abhayapala

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

In binaural systems, source localization in the median plane is challenging due to the difficulty of exploring the spectral cues of the head-related transfer function (HRTF) independently of the source spectra. This paper presents a method of extracting the HRTF spectral cues using cepstral analysis for speech source localization in the median plane. Binaural signals are preprocessed in the cepstral domain so that the fine spectral structure of speech and the HRTF spectral envelope can be easily separated. We introduce (i) a truncated cepstral transformation to extract the relevant localization cues, and (ii) a mechanism to normalize the effects of the time varying speech spectra. The proposed method is evaluated and compared with a convolution based localization method using a speech corpus of multiple speakers. The results suggest that the proposed method fully exploits the available spectral cues for robust speaker independent binaural source localization in the median plane.

源语言英语
主期刊名2014 Proceedings of the 22nd European Signal Processing Conference, EUSIPCO 2014
出版商European Signal Processing Conference, EUSIPCO
2055-2059
页数5
ISBN(电子版)9780992862619
出版状态已出版 - 10 11月 2014
已对外发布
活动22nd European Signal Processing Conference, EUSIPCO 2014 - Lisbon, 葡萄牙
期限: 1 9月 20145 9月 2014

出版系列

姓名European Signal Processing Conference
ISSN(印刷版)2219-5491

会议

会议22nd European Signal Processing Conference, EUSIPCO 2014
国家/地区葡萄牙
Lisbon
时期1/09/145/09/14

指纹

探究 'Binaural localization of speech sources in the median plane using cepstral hrtf extraction' 的科研主题。它们共同构成独一无二的指纹。

引用此