Recognition of noisy speech using dynamic spectral subband centroids

Jingdong Chen; Yiteng Arden Huang; Qi Li; Kuldip K. Paliwal

doi:10.1109/LSP.2003.821689

Recognition of noisy speech using dynamic spectral subband centroids

Jingdong Chen, Yiteng Arden Huang, Qi Li, Kuldip K. Paliwal

Research output: Contribution to journal › Article › peer-review

47 Scopus citations

Abstract

Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.

Original language	English
Pages (from-to)	258-261
Number of pages	4
Journal	IEEE Signal Processing Letters
Volume	11
Issue number	2 PART II
DOIs	https://doi.org/10.1109/LSP.2003.821689
State	Published - Feb 2004
Externally published	Yes

Keywords

Cepstrum
Robust speech recognition
Subband centroid

Access to Document

10.1109/LSP.2003.821689

Cite this

@article{492f29074ac845fdba60f2ae403b06c1,

title = "Recognition of noisy speech using dynamic spectral subband centroids",

abstract = "Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.",

keywords = "Cepstrum, Robust speech recognition, Subband centroid",

author = "Jingdong Chen and Huang, {Yiteng Arden} and Qi Li and Paliwal, {Kuldip K.}",

year = "2004",

month = feb,

doi = "10.1109/LSP.2003.821689",

language = "英语",

volume = "11",

pages = "258--261",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2 PART II",

}

TY - JOUR

T1 - Recognition of noisy speech using dynamic spectral subband centroids

AU - Chen, Jingdong

AU - Huang, Yiteng Arden

AU - Li, Qi

AU - Paliwal, Kuldip K.

PY - 2004/2

Y1 - 2004/2

N2 - Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.

AB - Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.

KW - Cepstrum

KW - Robust speech recognition

KW - Subband centroid

UR - http://www.scopus.com/inward/record.url?scp=0442326756&partnerID=8YFLogxK

U2 - 10.1109/LSP.2003.821689

DO - 10.1109/LSP.2003.821689

M3 - 文章

AN - SCOPUS:0442326756

SN - 1070-9908

VL - 11

SP - 258

EP - 261

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

IS - 2 PART II

ER -

Recognition of noisy speech using dynamic spectral subband centroids

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this