An improved Latent Dirichlet Allocation method for service topic detection

Lantian Guo, Zhe Li, Tao Yang, Huixiang Zhang, Dejun Mu, Yang Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Service topic detection is one of the most important techniques in service information extraction, clustering and recommendation. Comparing with short text corpus in social network, service description corpus possesses higher dimensionality and more diversity. It is difficult to detect topics from a large number of service descriptions. To address these challenges, we proposed a new LDA (Latent Dirichlet Allocation) model based topic detection method, referred to as CV-LDA (Context sensitive word Vector based LDA). It utilizes a word embedding based method that generate context sensitive vector to cluster the words for decreasing dimensionality. Through topic perplexity analysis in the real-world dataset, it is obvious that topics detected by our method has a lower perplexity, comparing with word frequency weighing based vectors.

源语言英语
主期刊名Proceedings of the 35th Chinese Control Conference, CCC 2016
编辑Jie Chen, Qianchuan Zhao, Jie Chen
出版商IEEE Computer Society
7045-7049
页数5
ISBN(电子版)9789881563910
DOI
出版状态已出版 - 26 8月 2016
活动35th Chinese Control Conference, CCC 2016 - Chengdu, 中国
期限: 27 7月 201629 7月 2016

出版系列

姓名Chinese Control Conference, CCC
2016-August
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议35th Chinese Control Conference, CCC 2016
国家/地区中国
Chengdu
时期27/07/1629/07/16

指纹

探究 'An improved Latent Dirichlet Allocation method for service topic detection' 的科研主题。它们共同构成独一无二的指纹。

引用此