SQL based frequent pattern mining with FP-growth

Xuequn Shang, Kai Uwe Sattler, Ingolf Geist

科研成果: 书/报告/会议事项章节会议稿件同行评审

12 引用 (Scopus)

摘要

Scalable data mining in large databases is one of today's real challenges to database research area. The integration of data mining with database systems is an essential component for any successful large-scale data mining application. A fundamental component in data mining tasks is finding frequent patterns in a given dataset. Most of the previous studies adopt an Apriori-like candidate set generation-and-test approach. However, candidate set generation is still costly, especially when there exist prolific patterns and/or long patterns. In this study we present an evaluation of SQL based frequent pattern mining with a novel frequent pattern growth (FP-growth) method, which is efficient and scalable for mining both long and short patterns without candidate generation. We examine some techniques to improve performance. In addition, we have made performance evaluation on DBMS with IBM DB2 UDB EEE V8.

源语言英语
主期刊名Applic. of Declarative Program. and Knowledge Manage. - 15th Int. Conf. on Applications of Declarative Program. and Knowledge Manage., INAP 2004, and 18th Workshop on Logic Program., WLP 2004
出版商Springer Verlag
32-46
页数15
ISBN(印刷版)3540255605, 9783540255604
DOI
出版状态已出版 - 2005
已对外发布
活动15th International Conference on Applications of Declarative Programming and Knowledge Management, INAP 2004, and 18th Workshop on Logic Programming, WLP 2004 - Applications of Declarative Programming and Knowledge Management - Potsdam, 德国
期限: 4 3月 20046 3月 2004

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
3392 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议15th International Conference on Applications of Declarative Programming and Knowledge Management, INAP 2004, and 18th Workshop on Logic Programming, WLP 2004 - Applications of Declarative Programming and Knowledge Management
国家/地区德国
Potsdam
时期4/03/046/03/04

指纹

探究 'SQL based frequent pattern mining with FP-growth' 的科研主题。它们共同构成独一无二的指纹。

引用此