Back to Search Start Over

Algorithms for mining frequent itemsets in static and dynamic datasets.

Authors :
Hernández-León, R.
Hernández-Palancar, J.
Carrasco-Ochoa, Jesús A.
Martínez-Trinidad, José Fco.
Source :
Intelligent Data Analysis; 2010, Vol. 14 Issue 3, p419-435, 17p, 5 Diagrams, 2 Charts, 11 Graphs
Publication Year :
2010

Abstract

In this paper, two algorithms for mining frequent itemsets in large sparse datasets are proposed. The first one, named Compressed Arrays (CA), allows to process datasets that do not change along the time (static datasets) while the second one, based on the ideas of the former and named Dynamic Compressed Arrays (DCA), processes datasets that change along the time by adding/deleting transactions (dynamic datasets). Both algorithms introduce a novel way to use equivalence classes of itemsets by performing a breadth first search through them and by storing the class prefix support in compressed arrays, which allows fast itemset support computing. On the other hand, unlike previous algorithms for dynamic datasets that store the full dataset in main memory without reusing the current frequent itemsets, DCA algorithm stores the current frequent itemsets in binary files, grouped in equivalence classes, and reuses them to calculate the new frequent itemsets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1088467X
Volume :
14
Issue :
3
Database :
Complementary Index
Journal :
Intelligent Data Analysis
Publication Type :
Academic Journal
Accession number :
50633300
Full Text :
https://doi.org/10.3233/IDA-2010-0429