Head motion generation for speech-driven talking avatar

Bingfeng Li, Lei Xie, Pengcheng Zhu, Bo Fan

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

This study describes methods for predicting head motion from acoustic speech. Current hidden Markov model (HMM)-based methods rely on definitions of typical head motion patterns and accurate recognition of these patterns. This study investigates the head motion prediction performance of various pattern definition strategies. The HMM method is less effective because the association between speech and the head gestures is essentially a nondeterministic, many-to-many mapping so the head motion pattern recognition accuracy is quite low. Therefore, this study treats the speech-to-head-motion mapping task as a regression problem. A back-propagation (BP) neutral network is used to seek a direct, continuous mapping from the acoustic speech to the head motion. Tests show that this neutral network approach significantly improves the head motion prediction accuracy and the naturalness of head movement of a talking avatar.

源语言英语
页(从-至)898-902
页数5
期刊Qinghua Daxue Xuebao/Journal of Tsinghua University
53
6
出版状态已出版 - 2013

指纹

探究 'Head motion generation for speech-driven talking avatar' 的科研主题。它们共同构成独一无二的指纹。

引用此