Back to Search Start Over

Disclosed: An efficient depth-first, top-down algorithm for mining disjunctive closed itemsets in high-dimensional data.

Authors :
Vimieiro, Renato
Moscato, Pablo
Source :
Information Sciences. Oct2014, Vol. 280, p171-187. 17p.
Publication Year :
2014

Abstract

Abstract: We focus, in this paper, on the computational challenges of identifying disjunctive Boolean patterns in high-dimensional data. We conduct our analysis focusing particularly in microarray gene expression data, since this is one of the most stereotypical examples of high-dimensional data. We devised a novel algorithm that takes advantage of the scarcity of samples in microarray data sets, allowing us to efficiently find disjunctive closed patterns. Our algorithm, Disclosed, mines disjunctive closed itemsets by exploring the search space in a depth-first, top-down manner. We evaluated the performance of our algorithm to execute such a task using real microarray gene expression data sets publicly available on the Internet. Our experiments revealed under what situations, the characteristics of a data set, our method obtain a good, bad or average performance. We also compared the performance of our method with the state of the art algorithms for finding disjunctive closed patterns and disjunctive minimal generators. We observed that our approach is two orders of magnitude more efficient, both in terms of time and memory. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
00200255
Volume :
280
Database :
Academic Search Index
Journal :
Information Sciences
Publication Type :
Periodical
Accession number :
96668017
Full Text :
https://doi.org/10.1016/j.ins.2014.04.044