Attention-based Dual Stream Interactive Network for Nonlinear Residual Echo Suppression

Kai Xie, Ziye Yang, Jie Chen, Junjie Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Acoustic echo cancellation systems commonly employ linear adaptive filters to identify speaker-to-microphone echo paths. However, accurately estimating the echo path is challenging due to the nonlinear relationship between the far-end signal and the echo signal, leading to residual echo generation. Consequently, a post-suppression module is crucial for sufficient echo attenuation. Conventional deep learning-based methods for residual echo suppression (RES) rely on linear operations, such as addition and concatenation, disregarding the contextual information needed to effectively fuse error and auxiliary signal features. In this paper, we propose a novel end-to-end method for RES, which introduces an attentional fusion module that aggregates global and local contexts, as well as dynamically calculates the fusion weights for different signal features, enabling the neural network to efficiently leverage the correlation between these signals. Experimental results demonstrate the superiority of the proposed method.

Original languageEnglish
Title of host publication32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings
PublisherEuropean Signal Processing Conference, EUSIPCO
Pages221-225
Number of pages5
ISBN (Electronic)9789464593617
DOIs
StatePublished - 2024
Event32nd European Signal Processing Conference, EUSIPCO 2024 - Lyon, France
Duration: 26 Aug 202430 Aug 2024

Publication series

NameEuropean Signal Processing Conference
ISSN (Print)2219-5491

Conference

Conference32nd European Signal Processing Conference, EUSIPCO 2024
Country/TerritoryFrance
CityLyon
Period26/08/2430/08/24

Keywords

  • Residual echo suppression
  • attention based fusion
  • dual-stream
  • time-domain network

Fingerprint

Dive into the research topics of 'Attention-based Dual Stream Interactive Network for Nonlinear Residual Echo Suppression'. Together they form a unique fingerprint.

Cite this