Semi-blind dual-microphone noise reduction with known target localization

Jian Zhang, Zhonghua Fu, Lei Xie, Yali Zhao

科研成果: 期刊稿件文章同行评审

摘要

Noise reduction is essential for practical speech recognition systems. In many applications, the target speaker location is fixed, but the interference information such as the type, number and locations are unknown, and may even change over time. This paper presents a semi-blind dual-microphone noise reduction method for these problems which is based on the sparsity of the speech in the time-frequency distribution. The target speaker location is assumed to be known and fixed for building a spatial location model. The spatial location model of the unknown noise is obtained using model adaptation based on the target speaker model. Then, every time-frequency bin of mixed signals is classified to build a binary mask. Finally, the target speech is re-synthesized with the binary mask. Tests show that this approach significantly reduces complicated noise with little speech distortion. The performance is close to that of the un-blind degenerate unmixing estimation method.

源语言英语
页(从-至)1215-1219+1225
期刊Qinghua Daxue Xuebao/Journal of Tsinghua University
51
9
出版状态已出版 - 9月 2011

指纹

探究 'Semi-blind dual-microphone noise reduction with known target localization' 的科研主题。它们共同构成独一无二的指纹。

引用此