RC-MES: A novel speaker modeling technique based on regression class for speaker identification

Zhong Hua Fu, Lei Xie, Rong Chun Zhao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Speaker modeling technique is an essential problem to robust speaker recognition, especially when enrolment data is sparse. This paper presents a novel modeling approach named Multi-EigenSpace modeling technique based on Regression Class (RC-MES), which integrates the common eigenspace technique and the regression class (RC) idea of Maximum Likelihood Linear Regression (MLLR). RC-MES not only solves the problem of prior knowledge limitation of Gaussian Mixture Models (GMM) but also remedies the shortcoming of common eigenspace that confuses speaker differences and phoneme differences. The eigenvoice analysis in RC can provide better discrimination ability between different speakers. The experimental results on speaker identification of 75 males show that, when enrolment data is sparse, RC-MES provides significant improvement over GMM, and the number of eigenvoices in RC-MES is fewer than that in common eigenspace.

Original languageEnglish
Title of host publication2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, ISIMP 2004
Pages214-217
Number of pages4
StatePublished - 2004
Event2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, ISIMP 2004 - Hong Kong, China, Hong Kong
Duration: 20 Oct 200422 Oct 2004

Publication series

Name2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, ISIMP 2004

Conference

Conference2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, ISIMP 2004
Country/TerritoryHong Kong
CityHong Kong, China
Period20/10/0422/10/04

Fingerprint

Dive into the research topics of 'RC-MES: A novel speaker modeling technique based on regression class for speaker identification'. Together they form a unique fingerprint.

Cite this