Exemplar-based sparse representation of timbre and prosody for voice conversion

Huaiping Ming, Dongyan Huang, Lei Xie, Shaofei Zhang, Minghui Dong, Haizhou Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

35 引用 (Scopus)

摘要

Voice conversion (VC) aims to make one speaker (source) to sound like spoken by another speaker (target) without changing the language content. Most of the state-of-the-art voice conversion systems focus only on timbre conversion. However, the speaker identity is characterized by the source-related cues such as fundamental frequency and energy as well. In this work, we propose an exemplarbased sparse representation of timbre and prosody for voice conversion that does not necessitate separately timbre conversion and prosody conversions. The experiment results show that, in addition to the conversion of spectral features, the proper conversion of prosody features will improve the quality and speaker identity of the converted speech.

源语言英语
主期刊名2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
5175-5179
页数5
ISBN(电子版)9781479999880
DOI
出版状态已出版 - 18 5月 2016
活动41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Shanghai, 中国
期限: 20 3月 201625 3月 2016

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2016-May
ISSN(印刷版)1520-6149

会议

会议41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016
国家/地区中国
Shanghai
时期20/03/1625/03/16

指纹

探究 'Exemplar-based sparse representation of timbre and prosody for voice conversion' 的科研主题。它们共同构成独一无二的指纹。

引用此