Abstract
This paper proposes to model broadcast news prosody using conditional random fields (CRF) for news story segmentation. Broadcast news has both editorial prosody and speech prosody that convey essential structural information for story segmentation. Hence we extract prosodic features, including pause duration, pitch, intensity, rapidity, speaker change and music, for a sequence of boundary candidates. A linearchain CRF is used to label each candidate with boundary/nonboundary tags based on the prosodic features. Important interlabel relations and contextual feature interactions are effectively captured by CRF's sequential learning framework. Experiments show that the CRF approach outperforms decision tree (DT), support vector machines (SVM) and maximum entropy (ME) classifiers in prosody-based story segmentation.
| Original language | English |
|---|---|
| Pages | 253-256 |
| Number of pages | 4 |
| State | Published - 2010 |
| Event | 2nd Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2010 - Biopolis, Singapore Duration: 14 Dec 2010 → 17 Dec 2010 |
Conference
| Conference | 2nd Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2010 |
|---|---|
| Country/Territory | Singapore |
| City | Biopolis |
| Period | 14/12/10 → 17/12/10 |