Skip to main navigation Skip to search Skip to main content

A deep neural network approach for sentence boundary detection in broadcast news

  • Chenglin Xu
  • , Lei Xie
  • , Guangpu Huang
  • , Xiong Xiao
  • , Eng Siong Chng
  • , Haizhou Li
  • Northwestern Polytechnical University Xian
  • Nanyang Technological University
  • Agency for Science, Technology and Research, Singapore

Research output: Contribution to journalConference articlepeer-review

38 Scopus citations

Abstract

This paper presents a deep neural network (DNN) approach to sentence boundary detection in broadcast news. We extract prosodic and lexical features at each inter-word position in the transcripts and learn a sequential classifier to label these positions as either boundary or non-boundary. This work is realized by a hybrid DNN-CRF (conditional random field) architecture. The DNN accepts prosodic feature inputs and non-linearly maps them into boundary/non-boundary posterior probability outputs. Subsequently, the posterior probabilities are combined with lexical features and the integrated features are modeled by a linear-chain CRF. The CRF finally labels the inter-word positions as boundary or non-boundary by Viterbi decoding. Experiments show that, as compared with the state-of-the-art DTCRF approach [1], the proposed DNN-CRF approach achieves 16.7% and 4.1% reduction in NIST boundary detection error in reference and speech recognition transcripts, respectively.

Keywords

  • Deep neural network
  • Rich transcription
  • Sentence boundary detection
  • Structural event detection

Fingerprint

Dive into the research topics of 'A deep neural network approach for sentence boundary detection in broadcast news'. Together they form a unique fingerprint.

Cite this