TY - JOUR
T1 - Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments
AU - Long, Tao
AU - Chen, Jingdong
AU - Huang, Gongping
AU - Benesty, Jacob
AU - Cohen, Israel
N1 - Publisher Copyright:
© 2007-2012 IEEE.
PY - 2019/3
Y1 - 2019/3
N2 - Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such as the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.
AB - Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such as the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.
KW - Acoustic source localization
KW - householder transform
KW - minimum variance distortionless response (MVDR)
KW - multiple signal classification (MUSIC)
KW - phase transform
KW - projection
KW - steered response power
UR - http://www.scopus.com/inward/record.url?scp=85058093107&partnerID=8YFLogxK
U2 - 10.1109/JSTSP.2018.2885410
DO - 10.1109/JSTSP.2018.2885410
M3 - 文章
AN - SCOPUS:85058093107
SN - 1932-4553
VL - 13
SP - 143
EP - 155
JO - IEEE Journal on Selected Topics in Signal Processing
JF - IEEE Journal on Selected Topics in Signal Processing
IS - 1
M1 - 8565913
ER -