跳到主要导航 跳到搜索 跳到主要内容

Audio-visual human recognition using semi-supervised spectral learning and hidden Markov models

  • Wei Feng
  • , Lei Xie
  • , Jia Zeng
  • , Zhi Qiang Liu

科研成果: 期刊稿件文章同行评审

15 引用 (Scopus)

摘要

This paper presents a multimodal system for reliable human identity recognition under variant conditions. Our system fuses the recognition of face and speech with a general probabilistic framework. For face recognition, we propose a new spectral learning algorithm, which considers not only the discriminative relations among the training data but also the generative models for each class. Due to the tedious cost of face labeling in practice, our spectral face learning utilizes a semi-supervised strategy. That is, only a small number of labeled faces are used in our training step, and the labels are optimally propagated to other unlabeled training faces. Besides requiring much less labeled data, our algorithm also enables a natural way to explicitly train an outlier model that approximately represents unauthorized faces. To boost the robustness of our system for human recognition under various environments, our face recognition is further complemented by a speaker identification agent. Specifically, this agent models the statistical variations of fixed-phrase speech using speaker-dependent word hidden Markov models. Experiments on benchmark databases validate the effectiveness of our face recognition and speaker identification agents, and demonstrate that the recognition accuracy can be apparently improved by integrating these two independent biometric sources together.

源语言英语
页(从-至)188-195
页数8
期刊Journal of Visual Languages and Computing
20
3
DOI
出版状态已出版 - 6月 2009

指纹

探究 'Audio-visual human recognition using semi-supervised spectral learning and hidden Markov models' 的科研主题。它们共同构成独一无二的指纹。

引用此