TY - GEN
T1 - Distributed Weighted Prediction Error with Node Selection Strategy for Speech Dereverberation
AU - Chen, Jie
AU - Yang, Ziye
AU - Rahardja, Susanto
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Speech dereverberation seeks to remove late reflections that smear the temporal and spectral structure of far-field recordings. Although the weighted prediction error (WPE) method has achieved promising performance, its centralized architecture incurs prohibitive computational and communication overhead in distributed scenarios where microphones are spatially dispersed and connected to resource-constrained processors. In this paper, we first formulate a distributed WPE optimization that incorporates an envelope-variance (EV)-based node selection module to exclude low-quality nodes and focus cooperation on the reliable observations. We then enhance the per-node optimization by integrating deep speech priors via Regularization-by-Denoising (RED), leveraging a pretrained deep neural network (DNN) denoiser as a proximal operator. Thus, the proposed framework not only balances computational load and reduces inter-node bandwidth by exchanging compressed signal, but also yields substantial dereverberation gains in challenging acoustic and noisy environments. Experiments under both simulated and real-world conditions confirm the superiority of the proposed method.
AB - Speech dereverberation seeks to remove late reflections that smear the temporal and spectral structure of far-field recordings. Although the weighted prediction error (WPE) method has achieved promising performance, its centralized architecture incurs prohibitive computational and communication overhead in distributed scenarios where microphones are spatially dispersed and connected to resource-constrained processors. In this paper, we first formulate a distributed WPE optimization that incorporates an envelope-variance (EV)-based node selection module to exclude low-quality nodes and focus cooperation on the reliable observations. We then enhance the per-node optimization by integrating deep speech priors via Regularization-by-Denoising (RED), leveraging a pretrained deep neural network (DNN) denoiser as a proximal operator. Thus, the proposed framework not only balances computational load and reduces inter-node bandwidth by exchanging compressed signal, but also yields substantial dereverberation gains in challenging acoustic and noisy environments. Experiments under both simulated and real-world conditions confirm the superiority of the proposed method.
KW - deep speech priors
KW - Distributed speech dereverberation
KW - node selection
KW - the weighted prediction error
UR - https://www.scopus.com/pages/publications/105018471545
U2 - 10.1109/ICCIT65724.2025.11167402
DO - 10.1109/ICCIT65724.2025.11167402
M3 - 会议稿件
AN - SCOPUS:105018471545
T3 - Proceeding - 2025 4th International Conference on Creative Communication and Innovative Technology: Empowering Transformative MATURE LEADERSHIP: Harnessing Technological Advancement for Global Sustainability, ICCIT 2025
BT - Proceeding - 2025 4th International Conference on Creative Communication and Innovative Technology
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Creative Communication and Innovative Technology, ICCIT 2025
Y2 - 15 August 2025 through 16 August 2025
ER -