A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION

Zhiheng Wang; Hongsen He; Jingdong Chen; Jacob Benesty; Yi Yu

doi:10.1109/ICASSP48485.2024.10448270

A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION

Zhiheng Wang, Hongsen He, Jingdong Chen, Jacob Benesty, Yi Yu

School of Marine Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

2 Scopus citations

Abstract

This paper studies the problem of acoustic source localization in room environments. It presents an improved steered response power (SRP) approach with low-complexity and trade-off prewhitening. This method consists of two steps. In the first one, the linear predictor that is used to model the speech signals is formulated as a bilinear form, and a group of convex-constrained linear prediction sub-models with respect to dual sub-predictors are established to pre-filter microphone signals. The pre-filtered (prewhitened) microphone signals are subsequently used in SRP for speaker localization. Simulation results demonstrate the properties of the presented method: it is robust to reverberation and noise, and is computationally efficient thanks to the bilinear form.

Original language	English
Title of host publication	2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1046-1050
Number of pages	5
ISBN (Electronic)	9798350344851
DOIs	https://doi.org/10.1109/ICASSP48485.2024.10448270
State	Published - 2024
Event	2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of Duration: 14 Apr 2024 → 19 Apr 2024

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)	1520-6149

Conference

Conference	2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Country/Territory	Korea, Republic of
City	Seoul
Period	14/04/24 → 19/04/24

Keywords

Acoustic source localization
bilinear forms
linear prediction
trade-off prewhitening

Access to Document

10.1109/ICASSP48485.2024.10448270

Cite this

Wang, Z., He, H., Chen, J., Benesty, J., & Yu, Y. (2024). A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION. In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings (pp. 1046-1050). (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP48485.2024.10448270

Wang, Zhiheng ; He, Hongsen ; Chen, Jingdong et al. / A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 1046-1050 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

@inproceedings{910498ed9bbe466fb9dcf0a7a4b42dff,

title = "A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION",

abstract = "This paper studies the problem of acoustic source localization in room environments. It presents an improved steered response power (SRP) approach with low-complexity and trade-off prewhitening. This method consists of two steps. In the first one, the linear predictor that is used to model the speech signals is formulated as a bilinear form, and a group of convex-constrained linear prediction sub-models with respect to dual sub-predictors are established to pre-filter microphone signals. The pre-filtered (prewhitened) microphone signals are subsequently used in SRP for speaker localization. Simulation results demonstrate the properties of the presented method: it is robust to reverberation and noise, and is computationally efficient thanks to the bilinear form.",

keywords = "Acoustic source localization, bilinear forms, linear prediction, trade-off prewhitening",

author = "Zhiheng Wang and Hongsen He and Jingdong Chen and Jacob Benesty and Yi Yu",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 ; Conference date: 14-04-2024 Through 19-04-2024",

year = "2024",

doi = "10.1109/ICASSP48485.2024.10448270",

language = "英语",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1046--1050",

booktitle = "2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings",

}

Wang, Z, He, H, Chen, J, Benesty, J & Yu, Y 2024, A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION. in 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., pp. 1046-1050, 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024, Seoul, Korea, Republic of, 14/04/24. https://doi.org/10.1109/ICASSP48485.2024.10448270

A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION. / Wang, Zhiheng; He, Hongsen; Chen, Jingdong et al.
2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2024. p. 1046-1050 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION

AU - Wang, Zhiheng

AU - He, Hongsen

AU - Chen, Jingdong

AU - Benesty, Jacob

AU - Yu, Yi

PY - 2024

Y1 - 2024

N2 - This paper studies the problem of acoustic source localization in room environments. It presents an improved steered response power (SRP) approach with low-complexity and trade-off prewhitening. This method consists of two steps. In the first one, the linear predictor that is used to model the speech signals is formulated as a bilinear form, and a group of convex-constrained linear prediction sub-models with respect to dual sub-predictors are established to pre-filter microphone signals. The pre-filtered (prewhitened) microphone signals are subsequently used in SRP for speaker localization. Simulation results demonstrate the properties of the presented method: it is robust to reverberation and noise, and is computationally efficient thanks to the bilinear form.

AB - This paper studies the problem of acoustic source localization in room environments. It presents an improved steered response power (SRP) approach with low-complexity and trade-off prewhitening. This method consists of two steps. In the first one, the linear predictor that is used to model the speech signals is formulated as a bilinear form, and a group of convex-constrained linear prediction sub-models with respect to dual sub-predictors are established to pre-filter microphone signals. The pre-filtered (prewhitened) microphone signals are subsequently used in SRP for speaker localization. Simulation results demonstrate the properties of the presented method: it is robust to reverberation and noise, and is computationally efficient thanks to the bilinear form.

KW - Acoustic source localization

KW - bilinear forms

KW - linear prediction

KW - trade-off prewhitening

UR - http://www.scopus.com/inward/record.url?scp=85195395805&partnerID=8YFLogxK

U2 - 10.1109/ICASSP48485.2024.10448270

DO - 10.1109/ICASSP48485.2024.10448270

M3 - 会议稿件

AN - SCOPUS:85195395805

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 1046

EP - 1050

BT - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024

Y2 - 14 April 2024 through 19 April 2024

ER -

Wang Z, He H, Chen J, Benesty J, Yu Y. A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION. In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2024. p. 1046-1050. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP48485.2024.10448270

A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this