Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments

Tao Long, Jingdong Chen, Gongping Huang, Jacob Benesty, Israel Cohen

Research output: Contribution to journalArticlepeer-review

26 Scopus citations

Abstract

Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such as the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.

Original languageEnglish
Article number8565913
Pages (from-to)143-155
Number of pages13
JournalIEEE Journal on Selected Topics in Signal Processing
Volume13
Issue number1
DOIs
StatePublished - Mar 2019

Keywords

  • Acoustic source localization
  • householder transform
  • minimum variance distortionless response (MVDR)
  • multiple signal classification (MUSIC)
  • phase transform
  • projection
  • steered response power

Fingerprint

Dive into the research topics of 'Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments'. Together they form a unique fingerprint.

Cite this