Skip to main navigation Skip to search Skip to main content

An improved Latent Dirichlet Allocation method for service topic detection

  • Lantian Guo
  • , Zhe Li
  • , Tao Yang
  • , Huixiang Zhang
  • , Dejun Mu
  • , Yang Li
  • Northwestern Polytechnical University Xian
  • Xi'an Jiaotong University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Service topic detection is one of the most important techniques in service information extraction, clustering and recommendation. Comparing with short text corpus in social network, service description corpus possesses higher dimensionality and more diversity. It is difficult to detect topics from a large number of service descriptions. To address these challenges, we proposed a new LDA (Latent Dirichlet Allocation) model based topic detection method, referred to as CV-LDA (Context sensitive word Vector based LDA). It utilizes a word embedding based method that generate context sensitive vector to cluster the words for decreasing dimensionality. Through topic perplexity analysis in the real-world dataset, it is obvious that topics detected by our method has a lower perplexity, comparing with word frequency weighing based vectors.

Original languageEnglish
Title of host publicationProceedings of the 35th Chinese Control Conference, CCC 2016
EditorsJie Chen, Qianchuan Zhao, Jie Chen
PublisherIEEE Computer Society
Pages7045-7049
Number of pages5
ISBN (Electronic)9789881563910
DOIs
StatePublished - 26 Aug 2016
Event35th Chinese Control Conference, CCC 2016 - Chengdu, China
Duration: 27 Jul 201629 Jul 2016

Publication series

NameChinese Control Conference, CCC
Volume2016-August
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference35th Chinese Control Conference, CCC 2016
Country/TerritoryChina
CityChengdu
Period27/07/1629/07/16

Keywords

  • LDA Model
  • Perplexity
  • Service Topic
  • Word Embedding

Fingerprint

Dive into the research topics of 'An improved Latent Dirichlet Allocation method for service topic detection'. Together they form a unique fingerprint.

Cite this