Triseme decision trees in the continuous speech recognition system for talking head animation

Xie Lei; Zhao Rongchun; Jiang Dongmei; Cravyse Ilse; Sahli Hichem; Conlenis Jan

Triseme decision trees in the continuous speech recognition system for talking head animation

Xie Lei, Zhao Rongchun, Jiang Dongmei, Cravyse Ilse, Sahli Hichem, Conlenis Jan

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Viseme is an audio-visual model for speech-driven talking head animation. In this paper, a viseme HMM based speech recogntion system is built to drive a talking head. Triseme is used to take mouth shape contextual information into account to achieve accurate models. As models mushroomed, to get robust models using the limited training data, decision tree based state tying is adopted in the triseme modeling. Similarity of mouth shapes (SMS) is brought forward to design visual question set in the tree building process. Experimental results show that SMS is a good measurement of mouth shape contexts. Decision tree is a feasible way to get robust model parameter estimations.

源语言	英语
主期刊名	Proceedings of the International Conference on Active Media Technology
编辑	J.P. Li, J. Liu, N. Zhong, J. Yen, J. Zhao, J.P. Li, J. Liu, N. Zhong, J. Yen, J. Zhao
页	389-395
页数	7
出版状态	已出版 - 2003
活动	Proceedings of the Second International Conference on Active Media Technology - Chongqing, 中国期限: 29 5月 2003 → 31 5月 2003

出版系列

姓名	Proceedings of the International Conference on Active Media Technology

会议

会议	Proceedings of the Second International Conference on Active Media Technology
国家/地区	中国
市	Chongqing
时期	29/05/03 → 31/05/03

其它文件与链接

链接到 Scopus 的出版物

引用此

Lei, X., Rongchun, Z., Dongmei, J., Ilse, C., Hichem, S., & Jan, C. (2003). Triseme decision trees in the continuous speech recognition system for talking head animation. 在 J. P. Li, J. Liu, N. Zhong, J. Yen, J. Zhao, J. P. Li, J. Liu, N. Zhong, J. Yen, & J. Zhao (编辑), Proceedings of the International Conference on Active Media Technology (页码 389-395). (Proceedings of the International Conference on Active Media Technology).

Lei, Xie ; Rongchun, Zhao ; Dongmei, Jiang 等. / Triseme decision trees in the continuous speech recognition system for talking head animation. Proceedings of the International Conference on Active Media Technology. 编辑 / J.P. Li ; J. Liu ; N. Zhong ; J. Yen ; J. Zhao ; J.P. Li ; J. Liu ; N. Zhong ; J. Yen ; J. Zhao. 2003. 页码 389-395 (Proceedings of the International Conference on Active Media Technology).

@inproceedings{8006cde40b94447d800105ff35a84a9a,

title = "Triseme decision trees in the continuous speech recognition system for talking head animation",

abstract = "Viseme is an audio-visual model for speech-driven talking head animation. In this paper, a viseme HMM based speech recogntion system is built to drive a talking head. Triseme is used to take mouth shape contextual information into account to achieve accurate models. As models mushroomed, to get robust models using the limited training data, decision tree based state tying is adopted in the triseme modeling. Similarity of mouth shapes (SMS) is brought forward to design visual question set in the tree building process. Experimental results show that SMS is a good measurement of mouth shape contexts. Decision tree is a feasible way to get robust model parameter estimations.",

author = "Xie Lei and Zhao Rongchun and Jiang Dongmei and Cravyse Ilse and Sahli Hichem and Conlenis Jan",

year = "2003",

language = "英语",

isbn = "9812383433",

series = "Proceedings of the International Conference on Active Media Technology",

pages = "389--395",

editor = "J.P. Li and J. Liu and N. Zhong and J. Yen and J. Zhao and J.P. Li and J. Liu and N. Zhong and J. Yen and J. Zhao",

booktitle = "Proceedings of the International Conference on Active Media Technology",

note = "Proceedings of the Second International Conference on Active Media Technology ; Conference date: 29-05-2003 Through 31-05-2003",

}

Lei, X, Rongchun, Z, Dongmei, J, Ilse, C, Hichem, S & Jan, C 2003, Triseme decision trees in the continuous speech recognition system for talking head animation. 在 JP Li, J Liu, N Zhong, J Yen, J Zhao, JP Li, J Liu, N Zhong, J Yen & J Zhao (编辑), Proceedings of the International Conference on Active Media Technology. Proceedings of the International Conference on Active Media Technology, 页码 389-395, Proceedings of the Second International Conference on Active Media Technology, Chongqing, 中国, 29/05/03.

Triseme decision trees in the continuous speech recognition system for talking head animation. / Lei, Xie; Rongchun, Zhao; Dongmei, Jiang 等.
Proceedings of the International Conference on Active Media Technology. 编辑 / J.P. Li; J. Liu; N. Zhong; J. Yen; J. Zhao; J.P. Li; J. Liu; N. Zhong; J. Yen; J. Zhao. 2003. 页码 389-395 (Proceedings of the International Conference on Active Media Technology).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Triseme decision trees in the continuous speech recognition system for talking head animation

AU - Lei, Xie

AU - Rongchun, Zhao

AU - Dongmei, Jiang

AU - Ilse, Cravyse

AU - Hichem, Sahli

AU - Jan, Conlenis

PY - 2003

Y1 - 2003

N2 - Viseme is an audio-visual model for speech-driven talking head animation. In this paper, a viseme HMM based speech recogntion system is built to drive a talking head. Triseme is used to take mouth shape contextual information into account to achieve accurate models. As models mushroomed, to get robust models using the limited training data, decision tree based state tying is adopted in the triseme modeling. Similarity of mouth shapes (SMS) is brought forward to design visual question set in the tree building process. Experimental results show that SMS is a good measurement of mouth shape contexts. Decision tree is a feasible way to get robust model parameter estimations.

AB - Viseme is an audio-visual model for speech-driven talking head animation. In this paper, a viseme HMM based speech recogntion system is built to drive a talking head. Triseme is used to take mouth shape contextual information into account to achieve accurate models. As models mushroomed, to get robust models using the limited training data, decision tree based state tying is adopted in the triseme modeling. Similarity of mouth shapes (SMS) is brought forward to design visual question set in the tree building process. Experimental results show that SMS is a good measurement of mouth shape contexts. Decision tree is a feasible way to get robust model parameter estimations.

UR - http://www.scopus.com/inward/record.url?scp=0141929608&partnerID=8YFLogxK

M3 - 会议稿件

AN - SCOPUS:0141929608

SN - 9812383433

T3 - Proceedings of the International Conference on Active Media Technology

SP - 389

EP - 395

BT - Proceedings of the International Conference on Active Media Technology

A2 - Li, J.P.

A2 - Liu, J.

A2 - Zhong, N.

A2 - Yen, J.

A2 - Zhao, J.

A2 - Li, J.P.

A2 - Liu, J.

A2 - Zhong, N.

A2 - Yen, J.

A2 - Zhao, J.

T2 - Proceedings of the Second International Conference on Active Media Technology

Y2 - 29 May 2003 through 31 May 2003

ER -

Lei X, Rongchun Z, Dongmei J, Ilse C, Hichem S, Jan C. Triseme decision trees in the continuous speech recognition system for talking head animation. 在 Li JP, Liu J, Zhong N, Yen J, Zhao J, Li JP, Liu J, Zhong N, Yen J, Zhao J, 编辑, Proceedings of the International Conference on Active Media Technology. 2003. 页码 389-395. (Proceedings of the International Conference on Active Media Technology).

Triseme decision trees in the continuous speech recognition system for talking head animation

摘要

出版系列

会议

其它文件与链接

指纹

引用此