DOI

10.3906/elk-1602-113

Abstract

Association rule data mining is an important technique for finding important relationships in large datasets. Several frequent itemsets mining techniques have been proposed using a prefix-tree structure, FP-tree, a compressed data structure for database representation. The DIFFset data structure has also been shown to significantly reduce the run time and memory utilization of some data mining algorithms. Experimental results have demonstrated the efficiency of the two data structures in frequent itemsets mining. This work proposes FDM, a new algorithm based on FP-tree and DIFFset data structures for efficiently discovering frequent patterns in data. FDM can adapt its characteristics to efficiently mine long and short patterns from both dense and sparse datasets. Several optimization techniques are also outlined to increase the efficiency of FDM. An evaluation of FDM against three frequent itemset data mining algorithms, dEclat, FP-growth, and FDM* (FDM without optimization), was performed using datasets having both long and short frequent patterns. The experimental results show significant improvement in performance compared to the FP-growth, dEclat, and FDM* algorithms.

Keywords

Association rule data mining, FP-tree, Eclat, FP-growth, frequent itemsets

First Page

2096

Last Page

2107

Recommended Citation

GATUHA, GEORGE and JIANG, TAO (2017) "Smart frequent itemsets mining algorithm based on FP-tree and DIFFset data structures," Turkish Journal of Electrical Engineering and Computer Sciences: Vol. 25: No. 3, Article 39. https://doi.org/10.3906/elk-1602-113
Available at: https://journals.tubitak.gov.tr/elektrik/vol25/iss3/39

Download

Included in

Computer Engineering Commons, Computer Sciences Commons, Electrical and Computer Engineering Commons

COinS

Turkish Journal of Electrical Engineering and Computer Sciences

Smart frequent itemsets mining algorithm based on FP-tree and DIFFset data structures

DOI

Abstract

Keywords

First Page

Last Page

Recommended Citation

Included in

Issues by Year

Search

Turkish Journal of Electrical Engineering and Computer Sciences

Smart frequent itemsets mining algorithm based on FP-tree and DIFFset data structures

Authors

DOI

Abstract

Keywords

First Page

Last Page

Recommended Citation

Included in

Share

Issues by Year

Search