Modeling broadcast news prosody using conditional random fields for story segmentation

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li

Research output: Contribution to conferencePaperpeer-review

11 Scopus citations

Abstract

This paper proposes to model broadcast news prosody using conditional random fields (CRF) for news story segmentation. Broadcast news has both editorial prosody and speech prosody that convey essential structural information for story segmentation. Hence we extract prosodic features, including pause duration, pitch, intensity, rapidity, speaker change and music, for a sequence of boundary candidates. A linearchain CRF is used to label each candidate with boundary/nonboundary tags based on the prosodic features. Important interlabel relations and contextual feature interactions are effectively captured by CRF's sequential learning framework. Experiments show that the CRF approach outperforms decision tree (DT), support vector machines (SVM) and maximum entropy (ME) classifiers in prosody-based story segmentation.

Original languageEnglish
Pages253-256
Number of pages4
StatePublished - 2010
Event2nd Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2010 - Biopolis, Singapore
Duration: 14 Dec 201017 Dec 2010

Conference

Conference2nd Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2010
Country/TerritorySingapore
CityBiopolis
Period14/12/1017/12/10

Fingerprint

Dive into the research topics of 'Modeling broadcast news prosody using conditional random fields for story segmentation'. Together they form a unique fingerprint.

Cite this