Blind estimation of reverberation time using binaural complex ideal ratio mask

Ming Yang Chai, Tiantian Li, Mengyao Zhu, Tao Wang, Wen Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Accurate estimation of reverberation time T60 proved to have a positive effect on the automatic speech recognition (ASR) used in the voice-controlled devices and the reconstruction of the acoustic field. Recently, researchers have proposed some algorithms to estimate T60. However, few of them directly use the spatial information about the acoustic environment contained in the speech for accurate T60 estimation. We propose a deep learning approach as a regression problem to use binaural reverberant speech generated by the clean speech convolved with simulated Room Impulse Response (RIR) to estimate T60. Adaptive cIRM estimator firstly estimates the complex Ideal Ratio Mask (cIRM), which is strongly correlated with T60, and then a CNN-based T60 estimator is used to estimate T60 with cIRM. The experimental results show that our proposed approach outperforms the state-of-the-art method of T60 estimation.

源语言英语
主期刊名Proceedings - 2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019
出版商Institute of Electrical and Electronics Engineers Inc.
378-383
页数6
ISBN(电子版)9781538692141
DOI
出版状态已出版 - 7月 2019
活动2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019 - Shanghai, 中国
期限: 8 7月 201912 7月 2019

出版系列

姓名Proceedings - 2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019

会议

会议2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019
国家/地区中国
Shanghai
时期8/07/1912/07/19

指纹

探究 'Blind estimation of reverberation time using binaural complex ideal ratio mask' 的科研主题。它们共同构成独一无二的指纹。

引用此