跳到主要导航 跳到搜索 跳到主要内容

F-T-LSTM based complex network for joint acoustic echo cancellation and speech enhancement

  • Shimin Zhang
  • , Yuxiang Kong
  • , Shubo Lv
  • , Yanxin Hu
  • , Lei Xie

科研成果: 书/报告/会议事项章节会议稿件同行评审

11 引用 (Scopus)

摘要

With the increasing demand for audio communication and online conference, ensuring the robustness of Acoustic Echo Cancellation (AEC) under the complicated acoustic scenario including noise, reverberation and nonlinear distortion has become a top issue. Although there have been some traditional methods that consider nonlinear distortion, they are still inefficient for echo suppression and the performance will be attenuated when noise is present. In this paper, we present a real-time AEC approach using complex neural network to better modeling the important phase information and frequency-time-LSTMs (F-TLSTM), which scan both frequency and time axis, for better temporal modeling. Moreover, we utilize modified SI-SNR as cost function to make the model to have better echo cancellation and noise suppression (NS) performance. With only 1.4M parameters, the proposed approach outperforms the AECchallenge baseline by 0.27 in terms of Mean Opinion Score (MOS).

源语言英语
主期刊名22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
出版商International Speech Communication Association
791-795
页数5
ISBN(电子版)9781713836902
DOI
出版状态已出版 - 2021
活动22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021 - Brno, 捷克共和国
期限: 30 8月 20213 9月 2021

出版系列

姓名Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
2
ISSN(印刷版)2308-457X
ISSN(电子版)2958-1796

会议

会议22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
国家/地区捷克共和国
Brno
时期30/08/213/09/21

指纹

探究 'F-T-LSTM based complex network for joint acoustic echo cancellation and speech enhancement' 的科研主题。它们共同构成独一无二的指纹。

引用此