Topic embedding of sentences for story segmentation

Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

In this paper, we propose to embed sentences into fixed-dimensional vectors that carry the topic information for story segmentation. As a sentence comprises of a sequence of words and may have different lengths, we use long short-term memory recurrent neural network (LSTM-RNN) to summarize the information of the whole sentence and only predict the topic class at the last word in the sentence. The output of the network at the last word can be used as an embedding of the sentence in the topic space. We used the obtained sentence embeddings in the HMM-based story segmentation framework and obtained promising results. On the TDT2 corpus, the F1 measure is improved to 0.789 from 0.765 which is obtained by a competitive system using DNN and bag-of-words features.

源语言英语
主期刊名Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
出版商Institute of Electrical and Electronics Engineers Inc.
1602-1607
页数6
ISBN(电子版)9781538615423
DOI
出版状态已出版 - 2 7月 2017
活动9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017 - Kuala Lumpur, 马来西亚
期限: 12 12月 201715 12月 2017

出版系列

姓名Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
2018-February

会议

会议9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
国家/地区马来西亚
Kuala Lumpur
时期12/12/1715/12/17

指纹

探究 'Topic embedding of sentences for story segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此