跳到主要导航 跳到搜索 跳到主要内容

Dual-microphone based binary mask estimation for robust speaker verification

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Missing feature theory (MFT) has shown great potential for robust speaker recognition in noisy environments. Accurate estimation of binary mask is crucial in MFT-based speaker recognition. This paper addresses the speaker verification problem using MFT in a practical scenario: the location of target speaker is fixed while the locations of noise interferences are unknown. Specifically, we propose a dual-microphone semi-blind approach to estimate the binary mask. During system initialization, a spatial location model for the target is trained precisely. Then a spatial model for corrupted speech is obtained on-line by model adaptation. Finally, the binary mask is estimated by likelihood comparison. Moreover, we propose a reliable frame selection method to further focus on the reliable speech frames for missing data speaker recognition. Experimental results demonstrate that our proposed approach achieves substantial improvements in recognition performance in both white noise and speech corrupted conditions.

源语言英语
主期刊名ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
1014-1019
页数6
DOI
出版状态已出版 - 2012
活动2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012 - Shanghai, 中国
期限: 16 7月 201218 7月 2012

出版系列

姓名ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings

会议

会议2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
国家/地区中国
Shanghai
时期16/07/1218/07/12

指纹

探究 'Dual-microphone based binary mask estimation for robust speaker verification' 的科研主题。它们共同构成独一无二的指纹。

引用此