Plug-and-Play MVDR Beamforming for Speech Separation

Chengbo Chang; Ziye Yang; Jie Chen

doi:10.1109/ICASSP48485.2024.10445739

Plug-and-Play MVDR Beamforming for Speech Separation

Chengbo Chang, Ziye Yang, Jie Chen

School of Marine Science and Technology

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

As an adaptive beamformer, the Minimum Variance Distortionless Response (MVDR) method has proven its efficiency in separating target speech from background noise and interference. Conventionally, MVDR relies on physical information regarding signal angles and covariance matrices, however, ignores that the beamformer output can potentially benefit from the prior structures of speech signals. Motivated by the recent advance in integrating physics-based and data-driven approaches, this paper introduces a novel speech separation framework. Our approach enhances MVDR by incorporating Plug-and-Play (PnP) techniques to capture speech priors, specifically employing the Regularization by Denoising (RED) method to integrate prior speech information obtained from data into the optimization process. Experimental results validate the effectiveness of the proposed approach.

Original language	English
Title of host publication	2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1346-1350
Number of pages	5
ISBN (Electronic)	9798350344851
DOIs	https://doi.org/10.1109/ICASSP48485.2024.10445739
State	Published - 2024
Event	2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of Duration: 14 Apr 2024 → 19 Apr 2024

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)	1520-6149

Conference

Conference	2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Country/Territory	Korea, Republic of
City	Seoul
Period	14/04/24 → 19/04/24

Keywords

MVDR beamforming
PnP strategy
Speech separation
deep speech priors

Access to Document

10.1109/ICASSP48485.2024.10445739

Cite this

Chang, C., Yang, Z., & Chen, J. (2024). Plug-and-Play MVDR Beamforming for Speech Separation. In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings (pp. 1346-1350). (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP48485.2024.10445739

Chang, Chengbo ; Yang, Ziye ; Chen, Jie. / Plug-and-Play MVDR Beamforming for Speech Separation. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 1346-1350 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

@inproceedings{acfb38c09efd48f780966283cb253c78,

title = "Plug-and-Play MVDR Beamforming for Speech Separation",

abstract = "As an adaptive beamformer, the Minimum Variance Distortionless Response (MVDR) method has proven its efficiency in separating target speech from background noise and interference. Conventionally, MVDR relies on physical information regarding signal angles and covariance matrices, however, ignores that the beamformer output can potentially benefit from the prior structures of speech signals. Motivated by the recent advance in integrating physics-based and data-driven approaches, this paper introduces a novel speech separation framework. Our approach enhances MVDR by incorporating Plug-and-Play (PnP) techniques to capture speech priors, specifically employing the Regularization by Denoising (RED) method to integrate prior speech information obtained from data into the optimization process. Experimental results validate the effectiveness of the proposed approach.",

keywords = "MVDR beamforming, PnP strategy, Speech separation, deep speech priors",

author = "Chengbo Chang and Ziye Yang and Jie Chen",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 ; Conference date: 14-04-2024 Through 19-04-2024",

year = "2024",

doi = "10.1109/ICASSP48485.2024.10445739",

language = "英语",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1346--1350",

booktitle = "2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings",

}

Chang, C, Yang, Z & Chen, J 2024, Plug-and-Play MVDR Beamforming for Speech Separation. in 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., pp. 1346-1350, 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024, Seoul, Korea, Republic of, 14/04/24. https://doi.org/10.1109/ICASSP48485.2024.10445739

Plug-and-Play MVDR Beamforming for Speech Separation. / Chang, Chengbo; Yang, Ziye; Chen, Jie.
2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2024. p. 1346-1350 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Plug-and-Play MVDR Beamforming for Speech Separation

AU - Chang, Chengbo

AU - Yang, Ziye

AU - Chen, Jie

PY - 2024

Y1 - 2024

N2 - As an adaptive beamformer, the Minimum Variance Distortionless Response (MVDR) method has proven its efficiency in separating target speech from background noise and interference. Conventionally, MVDR relies on physical information regarding signal angles and covariance matrices, however, ignores that the beamformer output can potentially benefit from the prior structures of speech signals. Motivated by the recent advance in integrating physics-based and data-driven approaches, this paper introduces a novel speech separation framework. Our approach enhances MVDR by incorporating Plug-and-Play (PnP) techniques to capture speech priors, specifically employing the Regularization by Denoising (RED) method to integrate prior speech information obtained from data into the optimization process. Experimental results validate the effectiveness of the proposed approach.

AB - As an adaptive beamformer, the Minimum Variance Distortionless Response (MVDR) method has proven its efficiency in separating target speech from background noise and interference. Conventionally, MVDR relies on physical information regarding signal angles and covariance matrices, however, ignores that the beamformer output can potentially benefit from the prior structures of speech signals. Motivated by the recent advance in integrating physics-based and data-driven approaches, this paper introduces a novel speech separation framework. Our approach enhances MVDR by incorporating Plug-and-Play (PnP) techniques to capture speech priors, specifically employing the Regularization by Denoising (RED) method to integrate prior speech information obtained from data into the optimization process. Experimental results validate the effectiveness of the proposed approach.

KW - MVDR beamforming

KW - PnP strategy

KW - Speech separation

KW - deep speech priors

UR - http://www.scopus.com/inward/record.url?scp=85203881182&partnerID=8YFLogxK

U2 - 10.1109/ICASSP48485.2024.10445739

DO - 10.1109/ICASSP48485.2024.10445739

M3 - 会议稿件

AN - SCOPUS:85203881182

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 1346

EP - 1350

BT - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024

Y2 - 14 April 2024 through 19 April 2024

ER -

Chang C, Yang Z, Chen J. Plug-and-Play MVDR Beamforming for Speech Separation. In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2024. p. 1346-1350. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP48485.2024.10445739

Plug-and-Play MVDR Beamforming for Speech Separation

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this