Abstract
Although short-time Fourier Analysis based features such as LPCC and MFCC have been widely used in state-of-the-art speech recognizers, the short-time analysis technique suffers from the well-known trade-off between time and frequency resolution and works under the assumption that speech signal is short-time stationary. This paper investigates an approach to use Cohen's class bilinear time-frequency distributions representing speech signal for speech recognition. Preliminary experiments show that the new feature can better represent speech signal and can improve the accuracy of a speech recognizer.
Original language | English |
---|---|
Pages | 674-677 |
Number of pages | 4 |
State | Published - 1998 |
Externally published | Yes |
Event | Proceedings of the 1998 4th International Conference on Signal Processing Proceedings, ICSP '98 - Beijing, China Duration: 12 Oct 1998 → 16 Oct 1998 |
Conference
Conference | Proceedings of the 1998 4th International Conference on Signal Processing Proceedings, ICSP '98 |
---|---|
City | Beijing, China |
Period | 12/10/98 → 16/10/98 |