A Bidirectional LSTM Approach with Word Embeddings for Sentence Boundary Detection

Chenglin Xu, Lei Xie, Xiong Xiao

科研成果: 期刊稿件文章同行评审

11 引用 (Scopus)

摘要

Recovering sentence boundaries from speech and its transcripts is essential for readability and downstream speech and language processing tasks. In this paper, we propose to use deep recurrent neural network to detect sentence boundaries in broadcast news by modeling rich prosodic and lexical features extracted at each inter-word position. We introduce an unsupervised word embedding to represent word identity, learned from the Continuous Bag-of-Words (CBOW) model, into sentence boundary detection task as an effective feature. The word embedding contains syntactic information that is essential for this detection task. In addition, we propose another two low-dimensional word embeddings derived from a neural network that includes class and context information to represent words by supervised learning: one is extracted from the projection layer, the other one comes from the last hidden layer. Furthermore, we propose a deep bidirectional Long Short Term Memory (LSTM) based architecture with Viterbi decoding for sentence boundary detection. Under this framework, the long-range dependencies of prosodic and lexical information in temporal sequences are modeled effectively. Compared with previous state-of-the-art DNN-CRF method, the proposed LSTM approach reduces 24.8% and 9.8% relative NIST SU error in reference and recognition transcripts, respectively.

源语言英语
页(从-至)1063-1075
页数13
期刊Journal of Signal Processing Systems
90
7
DOI
出版状态已出版 - 1 7月 2018

指纹

探究 'A Bidirectional LSTM Approach with Word Embeddings for Sentence Boundary Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此