摘要
Although short-time Fourier Analysis based features such as LPCC and MFCC have been widely used in state-of-the-art speech recognizers, the short-time analysis technique suffers from the well-known trade-off between time and frequency resolution and works under the assumption that speech signal is short-time stationary. This paper investigates an approach to use Cohen's class bilinear time-frequency distributions representing speech signal for speech recognition. Preliminary experiments show that the new feature can better represent speech signal and can improve the accuracy of a speech recognizer.
源语言 | 英语 |
---|---|
页 | 674-677 |
页数 | 4 |
出版状态 | 已出版 - 1998 |
已对外发布 | 是 |
活动 | Proceedings of the 1998 4th International Conference on Signal Processing Proceedings, ICSP '98 - Beijing, China 期限: 12 10月 1998 → 16 10月 1998 |
会议
会议 | Proceedings of the 1998 4th International Conference on Signal Processing Proceedings, ICSP '98 |
---|---|
市 | Beijing, China |
时期 | 12/10/98 → 16/10/98 |