Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments

Tao Long; Jingdong Chen; Gongping Huang; Jacob Benesty; Israel Cohen

doi:10.1109/JSTSP.2018.2885410

Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments

Tao Long, Jingdong Chen, Gongping Huang, Jacob Benesty, Israel Cohen

School of Marine Science and Technology

Research output: Contribution to journal › Article › peer-review

26 Scopus citations

Abstract

Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such as the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.

Original language	English
Article number	8565913
Pages (from-to)	143-155
Number of pages	13
Journal	IEEE Journal on Selected Topics in Signal Processing
Volume	13
Issue number	1
DOIs	https://doi.org/10.1109/JSTSP.2018.2885410
State	Published - Mar 2019

Keywords

Acoustic source localization
householder transform
minimum variance distortionless response (MVDR)
multiple signal classification (MUSIC)
phase transform
projection
steered response power

Access to Document

10.1109/JSTSP.2018.2885410

Cite this

@article{9e6386fa40b6455bb1c9dfa23bf69cc8,

title = "Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments",

abstract = "Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such as the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.",

keywords = "Acoustic source localization, householder transform, minimum variance distortionless response (MVDR), multiple signal classification (MUSIC), phase transform, projection, steered response power",

author = "Tao Long and Jingdong Chen and Gongping Huang and Jacob Benesty and Israel Cohen",

note = "Publisher Copyright: {\textcopyright} 2007-2012 IEEE.",

year = "2019",

month = mar,

doi = "10.1109/JSTSP.2018.2885410",

language = "英语",

volume = "13",

pages = "143--155",

journal = "IEEE Journal on Selected Topics in Signal Processing",

issn = "1932-4553",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "1",

}

TY - JOUR

T1 - Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments

AU - Long, Tao

AU - Chen, Jingdong

AU - Huang, Gongping

AU - Benesty, Jacob

AU - Cohen, Israel

PY - 2019/3

Y1 - 2019/3

N2 - Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such as the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.

AB - Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such as the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.

KW - Acoustic source localization

KW - householder transform

KW - minimum variance distortionless response (MVDR)

KW - multiple signal classification (MUSIC)

KW - phase transform

KW - projection

KW - steered response power

UR - http://www.scopus.com/inward/record.url?scp=85058093107&partnerID=8YFLogxK

U2 - 10.1109/JSTSP.2018.2885410

DO - 10.1109/JSTSP.2018.2885410

M3 - 文章

AN - SCOPUS:85058093107

SN - 1932-4553

VL - 13

SP - 143

EP - 155

JO - IEEE Journal on Selected Topics in Signal Processing

JF - IEEE Journal on Selected Topics in Signal Processing

IS - 1

M1 - 8565913

ER -

Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this