跳到主要导航 跳到搜索 跳到主要内容

Continuous speech recognition for large vocabulary based on triphone DBN model

  • Northwestern Polytechnical University Xian
  • Vrije Universiteit Brussel

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

摘要

To avoid coarticulatory effects in continuous speech recognition, based on word-phone structure dynamic Bayesian network (WP-DBN) model and word-phone-state structure DBN (WPS-DBN) model, context-dependent triphone units are introduced. Two novel single stream DBN models, that is, word-triphone structure DBN (WT-DBN) and word-triphone-state structure DBN (WTS-DBN) models, are proposed for continuous speech recognition. WTS-DBN model is a triphone model and its modeling unit is triphone. It simulates a conventional HMM (hidden markov model) based triphone state-tying. Experimental results in large-vocabulary and clean speech environment show that the speech recognition rates of WTS-DBN model increase 20. 53%, 40.77%, 42.72% and 7.52% than those of the HMM, WT-DBN, WP-DBN and WPS-DBN models.

源语言英语
页(从-至)1-6
页数6
期刊Shuju Caiji Yu Chuli/Journal of Data Acquisition and Processing
24
1
出版状态已出版 - 1月 2009

指纹

探究 'Continuous speech recognition for large vocabulary based on triphone DBN model' 的科研主题。它们共同构成独一无二的指纹。

引用此