TY - GEN
T1 - An end-to-end neural network approach to story segmentation
AU - Yu, Jia
AU - Xie, Lei
AU - Xiao, Xiong
AU - Chng, Eng Siong
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - This paper proposes an end-to-end story segmentation approach based on long short-term memory (LSTM) - recurrent neural network (RNN). Traditional story segmentation approaches are a two-stage pipeline consisting of feature extraction and segmentation, each of which has its individual objective function. In other words, the objective function used to extract features is different from the true performance measure of story segmentation, which may degrade the segmentation results. In this paper, we combine the two components and optimize them jointly, using an LSTM-RNN. Specifically, one LSTM layer is used to extract sentence vectors, and another LSTM layer is used to predict story boundaries by taking as input of the sentence vectors. Importantly, the whole network is optimized directly to predict story boundaries. We also investigate bi-directional LSTM (BLSTM) that can utilize past and future information in the process of extracting sentence vectors and story boundary prediction. Experimental results on the TDT2 corpus show that the proposed approach achieves state-of-the-art performance in story segmentation.
AB - This paper proposes an end-to-end story segmentation approach based on long short-term memory (LSTM) - recurrent neural network (RNN). Traditional story segmentation approaches are a two-stage pipeline consisting of feature extraction and segmentation, each of which has its individual objective function. In other words, the objective function used to extract features is different from the true performance measure of story segmentation, which may degrade the segmentation results. In this paper, we combine the two components and optimize them jointly, using an LSTM-RNN. Specifically, one LSTM layer is used to extract sentence vectors, and another LSTM layer is used to predict story boundaries by taking as input of the sentence vectors. Importantly, the whole network is optimized directly to predict story boundaries. We also investigate bi-directional LSTM (BLSTM) that can utilize past and future information in the process of extracting sentence vectors and story boundary prediction. Experimental results on the TDT2 corpus show that the proposed approach achieves state-of-the-art performance in story segmentation.
UR - http://www.scopus.com/inward/record.url?scp=85050380414&partnerID=8YFLogxK
U2 - 10.1109/APSIPA.2017.8282023
DO - 10.1109/APSIPA.2017.8282023
M3 - 会议稿件
AN - SCOPUS:85050380414
T3 - Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
SP - 171
EP - 176
BT - Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
Y2 - 12 December 2017 through 15 December 2017
ER -