Triseme decision trees in the continuous speech recognition system for a talking head

  • Dong Mei Jiang
  • , Lei Xie
  • , Ilse Ravyse
  • , Rong Chun Zhao
  • , Hichem Sahli
  • , Jan Cornelis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In this paper, we present a viseme (the basic speech units in the visual domain) based continuous speech recognition system, which segments speech into viseme sequences with timing boundaries to drive a talking head. In the viseme Hidden Markov Model (HMM) training, the instances of a viseme with different contexts are formulated as trisemes. Based on the mouth shape parameters Liprounding and the defined viseme similarity weight (VSW) from the 3D viseme facial models, 166 questions concerning the viseme contexts are designed to build triseme decision trees to tie the states of the trisemes with similar contexts, so that they can share the same parameters. To evaluate the system performance, the image related measurements are also taken to evaluate the resulting viseme sequences, with 'jerky instances' in Liprounding and VSW graphs evaluating their smoothness. Results show that compared to the phoneme based system, the tied-state triseme based speech recognition system gives talking head animation with smoother and more plausible mouth shapes.

Original languageEnglish
Title of host publicationProceedings of 2002 International Conference on Machine Learning and Cybernetics
Pages2097-2101
Number of pages5
StatePublished - 2002
EventProceedings of 2002 International Conference on Machine Learning and Cybernetics - Beijing, China
Duration: 4 Nov 20025 Nov 2002

Publication series

NameProceedings of 2002 International Conference on Machine Learning and Cybernetics
Volume4

Conference

ConferenceProceedings of 2002 International Conference on Machine Learning and Cybernetics
Country/TerritoryChina
CityBeijing
Period4/11/025/11/02

Keywords

  • Jerky instances
  • Liprounding
  • Triseme decision tree
  • Viseme
  • Viseme similarity weight

Fingerprint

Dive into the research topics of 'Triseme decision trees in the continuous speech recognition system for a talking head'. Together they form a unique fingerprint.

Cite this