跳到主要导航 跳到搜索 跳到主要内容

Visualize speech: A continuous speech recognition system for facial animation using acoustic visemes

  • Lei Xie
  • , Dongmei Jiang
  • , Ravyse Ilse
  • , Rongchun Zhao
  • , Verhelst Werner
  • , Sahli Hichem
  • , Conlenis Jan

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper presents an acoustic viseme based continuous speech recognition system for speech driven talking face animation. The system is developed using viseme HMMs with acoustic speech as input only. Triseme HMMs are adopted to reflect the mouth shape contexts. Visual decision trees are introduced to get robust parameter training for triseme HMMs with the limited training data. In the tree building process, methods based on lip rounding and similarity of viseme shapes are introduced to design visual questions. The results from objective and subjective evaluations show that the talking face animation based on the speech recognition system provided by this paper outperforms the conventional phoneme based one, and it is possible to obtain visually relevant speech segmentation information from acoustic speech signal only.

源语言英语
主期刊名Proceedings of 2003 International Conference on Neural Networks and Signal Processing, ICNNSP'03
872-875
页数4
DOI
出版状态已出版 - 2003
活动2003 International Conference on Neural Networks and Signal Processing, ICNNSP'03 - Nanjing, 中国
期限: 14 12月 200317 12月 2003

出版系列

姓名Proceedings of 2003 International Conference on Neural Networks and Signal Processing, ICNNSP'03
2

会议

会议2003 International Conference on Neural Networks and Signal Processing, ICNNSP'03
国家/地区中国
Nanjing
时期14/12/0317/12/03

指纹

探究 'Visualize speech: A continuous speech recognition system for facial animation using acoustic visemes' 的科研主题。它们共同构成独一无二的指纹。

引用此