A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition

Jingdong Chen; Bo Xu; Taiyi Huang

doi:10.1109/ICASSP.1998.675343

A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition

Jingdong Chen, Bo Xu, Taiyi Huang

CAS - Institute of Automation

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

9 Scopus citations

Abstract

This paper presents a novel kind of speech feature which is the modified Mellin transform of the log-spectrum of the speech signal (short for MMTLS). Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popular used cepstrum. The preliminary experiments show that the performance of the MMTLS-based method is much better in comparison with those of the LPC- and MFC-based methods. Moreover, the error rate of this method is very consistent for different outlier speakers.

Original language	English
Title of host publication	Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998
Pages	629-632
Number of pages	4
DOIs	https://doi.org/10.1109/ICASSP.1998.675343
State	Published - 1998
Externally published	Yes
Event	1998 23rd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998 - Seattle, WA, United States Duration: 12 May 1998 → 15 May 1998

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume	2
ISSN (Print)	1520-6149

Conference

Conference	1998 23rd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998
Country/Territory	United States
City	Seattle, WA
Period	12/05/98 → 15/05/98

Access to Document

10.1109/ICASSP.1998.675343

Cite this

Chen, J., Xu, B., & Huang, T. (1998). A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition. In Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998 (pp. 629-632). Article 675343 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2). https://doi.org/10.1109/ICASSP.1998.675343

Chen, Jingdong ; Xu, Bo ; Huang, Taiyi. / A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition. Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998. 1998. pp. 629-632 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

@inproceedings{e26a1e7f49e14942a864ee7bb97993d7,

title = "A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition",

abstract = "This paper presents a novel kind of speech feature which is the modified Mellin transform of the log-spectrum of the speech signal (short for MMTLS). Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popular used cepstrum. The preliminary experiments show that the performance of the MMTLS-based method is much better in comparison with those of the LPC- and MFC-based methods. Moreover, the error rate of this method is very consistent for different outlier speakers.",

author = "Jingdong Chen and Bo Xu and Taiyi Huang",

year = "1998",

doi = "10.1109/ICASSP.1998.675343",

language = "英语",

isbn = "0780344286",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

pages = "629--632",

booktitle = "Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998",

note = "1998 23rd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998 ; Conference date: 12-05-1998 Through 15-05-1998",

}

Chen, J, Xu, B & Huang, T 1998, A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition. in Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998., 675343, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2, pp. 629-632, 1998 23rd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998, Seattle, WA, United States, 12/05/98. https://doi.org/10.1109/ICASSP.1998.675343

A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition. / Chen, Jingdong; Xu, Bo; Huang, Taiyi.
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998. 1998. p. 629-632 675343 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition

AU - Chen, Jingdong

AU - Xu, Bo

AU - Huang, Taiyi

PY - 1998

Y1 - 1998

N2 - This paper presents a novel kind of speech feature which is the modified Mellin transform of the log-spectrum of the speech signal (short for MMTLS). Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popular used cepstrum. The preliminary experiments show that the performance of the MMTLS-based method is much better in comparison with those of the LPC- and MFC-based methods. Moreover, the error rate of this method is very consistent for different outlier speakers.

AB - This paper presents a novel kind of speech feature which is the modified Mellin transform of the log-spectrum of the speech signal (short for MMTLS). Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popular used cepstrum. The preliminary experiments show that the performance of the MMTLS-based method is much better in comparison with those of the LPC- and MFC-based methods. Moreover, the error rate of this method is very consistent for different outlier speakers.

UR - http://www.scopus.com/inward/record.url?scp=0011498037&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.1998.675343

DO - 10.1109/ICASSP.1998.675343

M3 - 会议稿件

AN - SCOPUS:0011498037

SN - 0780344286

SN - 9780780344280

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 629

EP - 632

BT - Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998

T2 - 1998 23rd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998

Y2 - 12 May 1998 through 15 May 1998

ER -

Chen J, Xu B, Huang T. A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition. In Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998. 1998. p. 629-632. 675343. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP.1998.675343

A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this