摘要
This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time Lip TempoRAl Patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvments than conventional delta features.
源语言 | 英语 |
---|---|
页 | 703-706 |
页数 | 4 |
出版状态 | 已出版 - 2004 |
活动 | 2004 7th International Conference on Signal Processing Proceedings (ICSP'04) - Beijing, 中国 期限: 31 8月 2004 → 4 9月 2004 |
会议
会议 | 2004 7th International Conference on Signal Processing Proceedings (ICSP'04) |
---|---|
国家/地区 | 中国 |
市 | Beijing |
时期 | 31/08/04 → 4/09/04 |