跳到主要导航 跳到搜索 跳到主要内容

Broadcast news story segmentation using conditional random fields and multimodal features

  • Xiaoxuan Wang
  • , Lei Xie
  • , Mimi Lu
  • , Bin Ma
  • , Eng Siong Chng
  • , Haizhou Li
  • Northwestern Polytechnical University Xian
  • Agency for Science, Technology and Research, Singapore
  • Nanyang Technological University

科研成果: 期刊稿件文章同行评审

17 引用 (Scopus)

摘要

In this paper, we propose integration of multimodal features using conditional random fields (CRFs) for the segmentation of broadcast news stories. We study story boundary cues from lexical, audio and video modalities, where lexical features consist of lexical similarity, chain strength and overall cohesiveness; acoustic features involve pause duration, pitch, speaker change and audio event type; and visual features contain shot boundaries, anchor faces and news title captions. These features are extracted in a sequence of boundary candidate positions in the broadcast news. A linear-chain CRF is used to detect each candidate as boundary/non-boundary tags based on the multimodal features. Important interlabel relations and contextual feature information are effectively captured by the sequential learning framework of CRFs. Story segmentation experiments show that the CRF approach outperforms other popular classifiers, including decision trees (DTs), Bayesian networks (BNs), naive Bayesian classifiers (NBs), multilayer perception (MLP), support vector machines (SVMs) and maximum entropy (ME) classifiers.

源语言英语
页(从-至)1206-1215
页数10
期刊IEICE Transactions on Information and Systems
E95-D
5
DOI
出版状态已出版 - 5月 2012

指纹

探究 'Broadcast news story segmentation using conditional random fields and multimodal features' 的科研主题。它们共同构成独一无二的指纹。

引用此