Continuous speech recognition for large vocabulary based on triphone DBN model

Guoyun Lv, Rongchun Zhao, Yanning Zhang, Yangyu Fan, Hichem Sahli

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

To avoid coarticulatory effects in continuous speech recognition, based on word-phone structure dynamic Bayesian network (WP-DBN) model and word-phone-state structure DBN (WPS-DBN) model, context-dependent triphone units are introduced. Two novel single stream DBN models, that is, word-triphone structure DBN (WT-DBN) and word-triphone-state structure DBN (WTS-DBN) models, are proposed for continuous speech recognition. WTS-DBN model is a triphone model and its modeling unit is triphone. It simulates a conventional HMM (hidden markov model) based triphone state-tying. Experimental results in large-vocabulary and clean speech environment show that the speech recognition rates of WTS-DBN model increase 20. 53%, 40.77%, 42.72% and 7.52% than those of the HMM, WT-DBN, WP-DBN and WPS-DBN models.

Original languageEnglish
Pages (from-to)1-6
Number of pages6
JournalShuju Caiji Yu Chuli/Journal of Data Acquisition and Processing
Volume24
Issue number1
StatePublished - Jan 2009

Keywords

  • Dynamic Bayesian network
  • Phone
  • Speech recognition
  • Triphone

Fingerprint

Dive into the research topics of 'Continuous speech recognition for large vocabulary based on triphone DBN model'. Together they form a unique fingerprint.

Cite this