Probabilistic labeled semi-supervised SVM

Mingjie Qian, Feiping Nie, Changshui Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Semi-supervised learning has been paid increasing attention and is widely used in many fields such as data mining, information retrieval and knowledge management as it can utilize both labeled and unlabeled data. Laplacian SVM (LapSVM) is a very classical method whose effectiveness has been validated by large number of experiments. However, LapSVM is sensitive to labeled data and it exposes to cubic computation complexity which limit its application in large scale scenario. In this paper, we propose a multi-class method called Probabilistic labeled Semi-supervised SVM (PLSVM) in which the optimal decision surface is taught by probabilistic labels of all the training data including the labeled and unlabeled data. Then we propose a kernel version dual coordinate descent method to efficiently solve the dual problems of our Probabilistic labeled Semi-supervised SVM and decrease its requirement of memory. Synthetic data and several benchmark real world datasets show that PLSVM is less sensitive to labeling and has better performance over traditional methods like SVM, LapSVM (LapSVM) and Transductive SVM (TSVM).

Original languageEnglish
Title of host publicationICDM Workshops 2009 - IEEE International Conference on Data Mining
Pages394-399
Number of pages6
DOIs
StatePublished - 2009
Externally publishedYes
Event2009 IEEE International Conference on Data Mining Workshops, ICDMW 2009 - Miami, FL, United States
Duration: 6 Dec 20096 Dec 2009

Publication series

NameICDM Workshops 2009 - IEEE International Conference on Data Mining

Conference

Conference2009 IEEE International Conference on Data Mining Workshops, ICDMW 2009
Country/TerritoryUnited States
CityMiami, FL
Period6/12/096/12/09

Keywords

  • Dual coordinate descent algorithm
  • Multi-class classification
  • Probabilistic label
  • Semi-supervised learning

Fingerprint

Dive into the research topics of 'Probabilistic labeled semi-supervised SVM'. Together they form a unique fingerprint.

Cite this