Abstract
This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time Lip TempoRAl Patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvments than conventional delta features.
Original language | English |
---|---|
Pages | 703-706 |
Number of pages | 4 |
State | Published - 2004 |
Event | 2004 7th International Conference on Signal Processing Proceedings (ICSP'04) - Beijing, China Duration: 31 Aug 2004 → 4 Sep 2004 |
Conference
Conference | 2004 7th International Conference on Signal Processing Proceedings (ICSP'04) |
---|---|
Country/Territory | China |
City | Beijing |
Period | 31/08/04 → 4/09/04 |
Keywords
- Feature extraction
- Lip temporal pattern
- Lipreading
- Visual speech recognition