跳到主要导航 跳到搜索 跳到主要内容

AUC Optimization for Deep Learning Based Voice Activity Detection

  • Zi Chen Fan
  • , Zhongxin Bai
  • , Xiao Lei Zhang
  • , Susanto Rahardja
  • , Jingdong Chen

科研成果: 书/报告/会议事项章节会议稿件同行评审

22 引用 (Scopus)

摘要

Voice activity detection (VAD) based on deep neural networks (DNN) has demonstrated good performance in adverse acoustic environments. Current DNN based VAD optimizes a surrogate function, e.g. minimum cross-entropy or minimum squared error, at a given decision threshold. However, VAD usually works on-the-fly with a dynamic decision threshold; and ROC curve is a global evaluation metric of VAD that reflects the performance of VAD at all possible decision thresholds. In this paper, we propose to optimize the area under ROC curve (AUC) by DNN, which can maximize the performance of VAD in terms of the ROC curve. Experimental results show that optimizing AUC by DNN results in higher performance than the common method of optimizing the minimum squared error by DNN.

源语言英语
主期刊名2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
6760-6764
页数5
ISBN(电子版)9781479981311
DOI
出版状态已出版 - 5月 2019
活动44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Brighton, 英国
期限: 12 5月 201917 5月 2019

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2019-May
ISSN(印刷版)1520-6149

会议

会议44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019
国家/地区英国
Brighton
时期12/05/1917/05/19

指纹

探究 'AUC Optimization for Deep Learning Based Voice Activity Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此