Depth-first frequent itemset mining in relational databases

Xuequn Shang, Kai Uwe Sattler

科研成果: 会议稿件论文同行评审

7 引用 (Scopus)

摘要

Data mining on large relational databases has gained popularity and its significance is well recognized. However, the performance of SQL based data mining is known to fall behind specialized implementation since the prohibitive nature of the cost associated with extracting knowledge, as well as the lack of suitable declarative query language support. We investigate approaches based on SQL for the problem of finding frequent patterns from a transaction table, including an algorithm that we recently proposed, called Propad (Pro-jection PAttern Discovery). Propad fundamentally differs from an Apriori-like candidate set generation-and-test approach. This approach successively projects the transaction table into frequent itemsets to avoid making multiple passes over the large original transaction table and generating a huge sets of candidates. We have made performance evaluation on DBMS (IBM DB2 UDB EEE V8) and compared the performance results with K-Way join approach proposed in [11] and SQL based FP-tree approach proposed in [13]. The experimental results show that our algorithm can get efficient performance.

源语言英语
1112-1117
页数6
DOI
出版状态已出版 - 2005
已对外发布
活动20th Annual ACM Symposium on Applied Computing - Santa Fe, NM, 美国
期限: 13 3月 200517 3月 2005

会议

会议20th Annual ACM Symposium on Applied Computing
国家/地区美国
Santa Fe, NM
时期13/03/0517/03/05

指纹

探究 'Depth-first frequent itemset mining in relational databases' 的科研主题。它们共同构成独一无二的指纹。

引用此