TY - GEN
T1 - Attention-based Dual Stream Interactive Network for Nonlinear Residual Echo Suppression
AU - Xie, Kai
AU - Yang, Ziye
AU - Chen, Jie
AU - Li, Junjie
N1 - Publisher Copyright:
© 2024 European Signal Processing Conference, EUSIPCO. All rights reserved.
PY - 2024
Y1 - 2024
N2 - Acoustic echo cancellation systems commonly employ linear adaptive filters to identify speaker-to-microphone echo paths. However, accurately estimating the echo path is challenging due to the nonlinear relationship between the far-end signal and the echo signal, leading to residual echo generation. Consequently, a post-suppression module is crucial for sufficient echo attenuation. Conventional deep learning-based methods for residual echo suppression (RES) rely on linear operations, such as addition and concatenation, disregarding the contextual information needed to effectively fuse error and auxiliary signal features. In this paper, we propose a novel end-to-end method for RES, which introduces an attentional fusion module that aggregates global and local contexts, as well as dynamically calculates the fusion weights for different signal features, enabling the neural network to efficiently leverage the correlation between these signals. Experimental results demonstrate the superiority of the proposed method.
AB - Acoustic echo cancellation systems commonly employ linear adaptive filters to identify speaker-to-microphone echo paths. However, accurately estimating the echo path is challenging due to the nonlinear relationship between the far-end signal and the echo signal, leading to residual echo generation. Consequently, a post-suppression module is crucial for sufficient echo attenuation. Conventional deep learning-based methods for residual echo suppression (RES) rely on linear operations, such as addition and concatenation, disregarding the contextual information needed to effectively fuse error and auxiliary signal features. In this paper, we propose a novel end-to-end method for RES, which introduces an attentional fusion module that aggregates global and local contexts, as well as dynamically calculates the fusion weights for different signal features, enabling the neural network to efficiently leverage the correlation between these signals. Experimental results demonstrate the superiority of the proposed method.
KW - Residual echo suppression
KW - attention based fusion
KW - dual-stream
KW - time-domain network
UR - http://www.scopus.com/inward/record.url?scp=85208417421&partnerID=8YFLogxK
U2 - 10.23919/eusipco63174.2024.10715013
DO - 10.23919/eusipco63174.2024.10715013
M3 - 会议稿件
AN - SCOPUS:85208417421
T3 - European Signal Processing Conference
SP - 221
EP - 225
BT - 32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings
PB - European Signal Processing Conference, EUSIPCO
T2 - 32nd European Signal Processing Conference, EUSIPCO 2024
Y2 - 26 August 2024 through 30 August 2024
ER -