An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity

Dong Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Quy Hy Nguyen, Minghui Dong, Eng Siong Chng, Haizhou Li

科研成果: 会议稿件论文同行评审

7 引用 (Scopus)

摘要

Voice conversion aims to modify the characteristics of one speaker to make it sound like spoken by another speaker without changing the language content. This task has attracted considerable attention and various approaches have been proposed since two decades ago. The evaluation of voice conversion approaches, usually through time-intensive subject listening tests, requires a huge amount of human labor. This paper proposes an automatic voice conversion evaluation strategy based on perceptual background noise distortion and speaker similarity. Experimental results show that our automatic evaluation results match the subjective listening results quite well. We further use our strategy to select best converted samples from multiple voice conversion systems and our submission achieves promising results in the voice conversion challenge (VCC2016).

源语言英语
44-51
页数8
出版状态已出版 - 2016
活动9th ISCA Speech Synthesis Workshop, SSW 2016 - Sunnyvale, 美国
期限: 13 9月 201615 9月 2016

会议

会议9th ISCA Speech Synthesis Workshop, SSW 2016
国家/地区美国
Sunnyvale
时期13/09/1615/09/16

指纹

探究 'An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity' 的科研主题。它们共同构成独一无二的指纹。

引用此