Large-scale adaptive semi-supervised learning via unified inductive and transductive model

De Wang, Feiping Nie, Heng Huang

科研成果: 书/报告/会议事项章节会议稿件同行评审

44 引用 (Scopus)

摘要

Most semi-supervised learning models propagate the labels over the Laplacian graph, where the graph should be built beforehand. However, the computational cost of constructing the Laplacian graph matrix is very high. On the other hand, when we do classification, data points lying around the decision boundary (boundary points) are noisy for learning the correct classifier and deteriorate the classification performance. To address these two challenges, in this paper, we propose an adaptive semi-supervised learning model. Different from previous semi-supervised learning approaches, our new model needn't construct the graph Laplacian matrix. Thus, our method avoids the huge computational cost required by previous methods, and achieves a computational complexity linear to the number of data points. Therefore, our method is scalable to large-scale data. Moreover, the proposed model adaptively suppresses the weights of boundary points, such that our new model is robust to the boundary points. An efficient algorithm is derived to alternatively optimize the model parameter and class probability distribution of the unlabeled data, such that the induction of classifier and the transduction of labels are adaptively unified into one framework. Extensive experimental results on six real-world data sets show that the proposed semi-supervised learning model outperforms other related methods in most cases.

源语言英语
主期刊名KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
出版商Association for Computing Machinery
482-491
页数10
ISBN(印刷版)9781450329569
DOI
出版状态已出版 - 2014
已对外发布
活动20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014 - New York, NY, 美国
期限: 24 8月 201427 8月 2014

出版系列

姓名Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

会议

会议20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014
国家/地区美国
New York, NY
时期24/08/1427/08/14

指纹

探究 'Large-scale adaptive semi-supervised learning via unified inductive and transductive model' 的科研主题。它们共同构成独一无二的指纹。

引用此