Binaural localization of speech sources in the median plane using cepstral hrtf extraction

Dumidu S. Talagala, Xiang Wu, Wen Zhang, Thushara D. Abhayapala

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

In binaural systems, source localization in the median plane is challenging due to the difficulty of exploring the spectral cues of the head-related transfer function (HRTF) independently of the source spectra. This paper presents a method of extracting the HRTF spectral cues using cepstral analysis for speech source localization in the median plane. Binaural signals are preprocessed in the cepstral domain so that the fine spectral structure of speech and the HRTF spectral envelope can be easily separated. We introduce (i) a truncated cepstral transformation to extract the relevant localization cues, and (ii) a mechanism to normalize the effects of the time varying speech spectra. The proposed method is evaluated and compared with a convolution based localization method using a speech corpus of multiple speakers. The results suggest that the proposed method fully exploits the available spectral cues for robust speaker independent binaural source localization in the median plane.

Original languageEnglish
Title of host publication2014 Proceedings of the 22nd European Signal Processing Conference, EUSIPCO 2014
PublisherEuropean Signal Processing Conference, EUSIPCO
Pages2055-2059
Number of pages5
ISBN (Electronic)9780992862619
StatePublished - 10 Nov 2014
Externally publishedYes
Event22nd European Signal Processing Conference, EUSIPCO 2014 - Lisbon, Portugal
Duration: 1 Sep 20145 Sep 2014

Publication series

NameEuropean Signal Processing Conference
ISSN (Print)2219-5491

Conference

Conference22nd European Signal Processing Conference, EUSIPCO 2014
Country/TerritoryPortugal
CityLisbon
Period1/09/145/09/14

Keywords

  • Binaural localization
  • cepstral transformation
  • head related transfer function (HRTF)
  • median plane

Fingerprint

Dive into the research topics of 'Binaural localization of speech sources in the median plane using cepstral hrtf extraction'. Together they form a unique fingerprint.

Cite this