Isolated word recognition in reverberant environments

Shu Guang Wang, Xiang Yang Zeng, Qiang Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The additive noise and channel distortion caused by reverberation can degrade the performance of isolated word recognition(IWR), and have become the key constraint to the applications of IWR. In this paper, we present a reverberation robust isolated word recognition method. By using the relative autocorrelation sequences (RAS) based voice activity detection, influences of additive noise can be eliminated. To reduce the channel distortion Cepstral mean subtraction (CMS) is employed in Mel frequency cepstral coefficients (MFCC) extraction. And Gaussian mixture model (GMM) is used for the statistical modeling. The performance of the presented method in various reverberation conditions was evaluated by the recognition experiments.

Original languageEnglish
Title of host publication2011 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2011
DOIs
StatePublished - 2011
Event2011 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2011 - Xi'an, China
Duration: 14 Sep 201116 Sep 2011

Publication series

Name2011 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2011

Conference

Conference2011 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2011
Country/TerritoryChina
CityXi'an
Period14/09/1116/09/11

Keywords

  • cepstral mean subtraction
  • Gaussian mixture model
  • isolated word recognition
  • Mel frequency cepstral coefficients
  • relative autocorrelation sequences
  • reverberation

Fingerprint

Dive into the research topics of 'Isolated word recognition in reverberant environments'. Together they form a unique fingerprint.

Cite this