跳到主要导航 跳到搜索 跳到主要内容

A robust hierarchical LIP tracking approach for lipreading and audio visual speech recognition

  • Lei Xie
  • , Xiu Li Cai
  • , Zhong Hua Fu
  • , Rong Chun Zhao
  • , Dong Mei Jiang

科研成果: 书/报告/会议事项章节会议稿件同行评审

13 引用 (Scopus)

摘要

This paper presents a robust hierarchical lip tracking approach (RoHiLTA) for lipreading and audio visual speech recognition (AVSR) applications. Lip regions of interest are subtly detected by motion and facial structure information. Improvements are made on Active Shape Models (ASMs) for extracting lip contours more accurately and efficiently from video sequences of a speaker's talking face in natural lighting conditions and without particular make-ups. Local and global ASM search algorithms are both improved by introducing color information, 2D mouth corner match, and robust estimation. For noise-free features, localization errors are automatically corrected by an interpolating scheme. A fast implementation of the hierarchical approach is also proposed. Extensive experiments show that the improved ASM can effectively reduce the lip locating errors. The fast implementation of RoHiLTA can consistently achieve superior performance to conventional ASMs in lip tracking tasks, and then can be effectively integrated in lipreading and AVSR systems.

源语言英语
主期刊名Proceedings of 2004 International Conference on Machine Learning and Cybernetics
3620-3624
页数5
出版状态已出版 - 2004
活动Proceedings of 2004 International Conference on Machine Learning and Cybernetics - Shanghai, 中国
期限: 26 8月 200429 8月 2004

出版系列

姓名Proceedings of 2004 International Conference on Machine Learning and Cybernetics
6

会议

会议Proceedings of 2004 International Conference on Machine Learning and Cybernetics
国家/地区中国
Shanghai
时期26/08/0429/08/04

指纹

探究 'A robust hierarchical LIP tracking approach for lipreading and audio visual speech recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此