A novel SM-DBN model for large-vocabulary continuous speech recognition and phone segmentation

Guoyun Lu, Dongmei Jiang, Yanning Zhang, Rongchun Zhao, Hichem Sahli

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

A novel SM-DBN (Single-stream Multi-state Dynamic Bayesian Network) model is proposed. It is an augmentation of the Single Stream DBN Phone-shared (SS-DBN-P) model proposed by Bilmes et al[4] whose basic recognition units are words, to which we add an extra level of hidden nodes-states, resulting in the SM-DBN model. In our model, a word is composed of its corresponding phones, a phone is composed of a fixed number of states, and a state is associated with the observation features. Essentially, it is a phone model whose basic recognition units are phones. We perform the recognition and segmentation experiments with both continuous digital speech database and large-vocabulary speech database, with the experimental results given in Tables 1 through 3 in the full paper. The experimental results on large-vocabulary and clean speech environment show preliminarily that the speech recognition rate of SM-DBN model is 13.01% and 35% higher than those of the HMM (Hidden Markov Model) and the SS-DBN-P model respectively, and that its phone segmentation accuracy is respectively 10% and 44% higher than the other two models.

源语言英语
页(从-至)173-178
页数6
期刊Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
26
2
出版状态已出版 - 4月 2008

指纹

探究 'A novel SM-DBN model for large-vocabulary continuous speech recognition and phone segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此