Abstract
Spectral subband centroid, which is essentially the first-order normalized moment, has been proposed for speech recognition and its robustness to additive noise has been demonstrated before. In this paper, we extend this concept to the use of normalized spectral subband moments (NSSM) for robust speech recognition. We show that normalized moments, if properly selected, yield comparable recognition performance as the cepstral coefficients in clean speech, while deliver a better performance than the cepstra in noisy environments. We also propose a procedure to construct the dynamic moments that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.
Original language | English |
---|---|
Pages | 2441-2444 |
Number of pages | 4 |
State | Published - 2002 |
Externally published | Yes |
Event | 7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States Duration: 16 Sep 2002 → 20 Sep 2002 |
Conference
Conference | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
---|---|
Country/Territory | United States |
City | Denver |
Period | 16/09/02 → 20/09/02 |