Back to Search Start Over

Hardware-Enhanced Association Rule Mining with Hashing and Pipelining.

Authors :
Ying-Hsiang Wen
Jen-Wei Huang
Ming-Syan Chen
Source :
IEEE Transactions on Knowledge & Data Engineering. Jun2008, Vol. 20 Issue 6, p784-795. 12p. 3 Black and White Photographs, 11 Diagrams, 1 Chart, 5 Graphs.
Publication Year :
2008

Abstract

Generally speaking, to implement Apriori-based association rule mining in hardware, one has to load candidate iten~sets and a database into the hardware. Since the capacity of the hardware architecture is fixed, if the number of candidate itemsets dr the number of items in the database is larger than the hardware capacity, the items are loaded into the hardware separately. The time complexity of those steps that need to load candidate itemsets or database items into the hardware is in proportion to the number of candidate itemsets multiplied by the number of items in the database. Too many candidate itemsets and a large database would create a performance bottleneck. In this paper, we propose a HAsh-based and PiPellned (abbreviated as HAPPI) architecture for hardware- enhanced association rule mining. We apply the pipeline methodology in the HAPPI architecture to compare itemsets with the database and collect useful information for reducing the number of candidate itemsets and items in the database simultaneou~ly. When the database is fed into the hardware, candidate itemsets are ~ompared with the items in the database to find frequent itemsets. At the same time, trimming information is collected from each transaction. In addition, itemsets are generated from transactions and hashed into a hash table. The useful trimming information and the hash table enable us to reduce the number of items in the database and the number of candidate itemsets. Therefore, we can effectively reduce the frequency of loading the database into the hardware. As such, HAPPI solves the bottleneck problem in a priori-based hardware schemes. We also derive some properties to investigate the performance of this hardware implementation. As shown by the experiment results, HAPPI significantly outperforms the previous hardware approach and the software algorithm in terms of execution time. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10414347
Volume :
20
Issue :
6
Database :
Academic Search Index
Journal :
IEEE Transactions on Knowledge & Data Engineering
Publication Type :
Academic Journal
Accession number :
32439726
Full Text :
https://doi.org/10.1109/TKDE.2008.39