跳到主要导航 跳到搜索 跳到主要内容

Wake word detection with alignment-free lattice-free MMI

  • Yiming Wang
  • , Hang Lv
  • , Daniel Povey
  • , Lei Xie
  • , Sanjeev Khudanpur
  • Johns Hopkins University
  • Northwestern Polytechnical University Xian
  • Xiaomi

科研成果: 书/报告/会议事项章节会议稿件同行评审

12 引用 (Scopus)

摘要

Always-on spoken language interfaces, e.g. personal digital assistants, rely on a wake word to start processing spoken input. We present novel methods to train a hybrid DNN/HMM wake word detection system from partially labeled training data, and to use it in on-line applications: (i) we remove the prerequisite of frame-level alignments in the LF-MMI training algorithm, permitting the use of un-transcribed training examples that are annotated only for the presence/absence of the wake word; (ii) we show that the classical keyword/filler model must be supplemented with an explicit non-speech (silence) model for good performance; (iii) we present an FST-based decoder to perform online detection. We evaluate our methods on two real data sets, showing 50%-90% reduction in false rejection rates at prespecified false alarm rates over the best previously published figures, and re-validate them on a third (large) data set.

源语言英语
主期刊名Interspeech 2020
出版商International Speech Communication Association
4258-4262
页数5
ISBN(印刷版)9781713820697
DOI
出版状态已出版 - 2020
活动21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 - Shanghai, 中国
期限: 25 10月 202029 10月 2020

出版系列

姓名Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
2020-October
ISSN(印刷版)2308-457X
ISSN(电子版)1990-9772

会议

会议21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
国家/地区中国
Shanghai
时期25/10/2029/10/20

指纹

探究 'Wake word detection with alignment-free lattice-free MMI' 的科研主题。它们共同构成独一无二的指纹。

引用此