A new speech feature insensitive to the variation of different speakers

Jingdong Chen, Bo Xu, Taiyi Huang

Research output: Contribution to journalArticlepeer-review

Abstract

A novel robust speech feature which is based on the modified Mellin transform is proposed in this paper. Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popularly used melscale frequency cepstral coefficients (MFCC). Experiment has been performed and the result shows that, in comparison with the MFCC, the new feature is able to not only improve the performance of a speaker-independent speech recognizer effectively, but also greatly reduce the standard deviation of the error rates for different outlier speakers.

Original languageEnglish
Pages (from-to)70-72
Number of pages3
JournalChinese Journal of Electronics
Volume8
Issue number1
StatePublished - 1999
Externally publishedYes

Keywords

  • Mellin transform
  • Speech recognition

Fingerprint

Dive into the research topics of 'A new speech feature insensitive to the variation of different speakers'. Together they form a unique fingerprint.

Cite this