TY - JOUR
T1 - Voice conversion using bayesian analysis and dynamic kernel features
AU - Li, Na
AU - Zeng, Xiangyang
AU - Qiao, Yu
AU - Li, Zhifeng
N1 - Publisher Copyright:
©, 2015, Science Press. All right reserved.
PY - 2015/5/1
Y1 - 2015/5/1
N2 - When the training utterances are sparse, the voice conversion method based on Mixture of Probabilistic Linear Regressions is subjected to overfitting problem. To address that case, we adopt dynamic kernel features to replace the cepstrum features of the original speaker and estimate the transformation parameters in sense of Maximizing a Posterior with Bayesian inference. First, the features of the original speaker are converted into dynamic kernel features by kernel transformation. Then the prior information of the transformation parameters is introduced. Finally, according to different assumptions about conversion error, we propose two different methods to estimate the transformation parameters. Compared to MPLR, the proposed method achieves 4.25% relative decrease on the average cepstrum distortion in objective evaluations and obtains higher score about naturalness and similarity in subjective evaluations. Experimental results indicate that the proposed method can alleviate the overfitting problem.
AB - When the training utterances are sparse, the voice conversion method based on Mixture of Probabilistic Linear Regressions is subjected to overfitting problem. To address that case, we adopt dynamic kernel features to replace the cepstrum features of the original speaker and estimate the transformation parameters in sense of Maximizing a Posterior with Bayesian inference. First, the features of the original speaker are converted into dynamic kernel features by kernel transformation. Then the prior information of the transformation parameters is introduced. Finally, according to different assumptions about conversion error, we propose two different methods to estimate the transformation parameters. Compared to MPLR, the proposed method achieves 4.25% relative decrease on the average cepstrum distortion in objective evaluations and obtains higher score about naturalness and similarity in subjective evaluations. Experimental results indicate that the proposed method can alleviate the overfitting problem.
UR - http://www.scopus.com/inward/record.url?scp=84930069934&partnerID=8YFLogxK
M3 - 文章
AN - SCOPUS:84930069934
SN - 0371-0025
VL - 40
SP - 455
EP - 461
JO - Shengxue Xuebao/Acta Acustica
JF - Shengxue Xuebao/Acta Acustica
IS - 3
ER -