Fundamental frequency modeling using wavelets for emotional voice conversion

Huaiping Ming, Dongyan Huang, Minghui Dong, Haizhou Li, Lei Xie, Shaofei Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

33 引用 (Scopus)

摘要

This paper is to show a representation of fundamental frequency (F0) using continuous wavelet transform (CWT) for prosody modeling in emotion conversion. Emotional conversion aims at converting speech from one emotion state to another. Specifically, we use CWT to decompose F0 into a five-scale representation that corresponds to five temporal scales. A neutral voice is converted to an emotional voice under an exemplar-based voice conversion framework, where both spectrum and F0 are simultaneously converted. The simulation results demonstrate that the dynamics of F0 in different temporal scales can be well captured and converted using the five-scale CWT representation. The converted speech signals are evaluated both objectively and subjectively, that confirm the effectiveness of the proposed method.

源语言英语
主期刊名2015 International Conference on Affective Computing and Intelligent Interaction, ACII 2015
出版商Institute of Electrical and Electronics Engineers Inc.
804-809
页数6
ISBN(电子版)9781479999538
DOI
出版状态已出版 - 2 12月 2015
活动2015 International Conference on Affective Computing and Intelligent Interaction, ACII 2015 - Xi'an, 中国
期限: 21 9月 201524 9月 2015

出版系列

姓名2015 International Conference on Affective Computing and Intelligent Interaction, ACII 2015

会议

会议2015 International Conference on Affective Computing and Intelligent Interaction, ACII 2015
国家/地区中国
Xi'an
时期21/09/1524/09/15

指纹

探究 'Fundamental frequency modeling using wavelets for emotional voice conversion' 的科研主题。它们共同构成独一无二的指纹。

引用此