Back to Search Start Over

Distributed randomized algorithms for low-support data mining

Authors :
Alfredo Pulvirenti
Rosalba Giugno
Misael Mongiovì
Alfredo Ferro
Source :
IPDPS
Publication Year :
2009
Publisher :
IEEE, 2009.

Abstract

Data mining in distributed systems has been facilitated by using high-support association rules. Less attention has been paid to distributed low-support/high-correlation data mining. This has proved useful in several fields such as computational biology, wireless networks, web mining, security and rare events analysis in industrial plants. In this paper we present distributed versions of efficient algorithms for low-support/high-correlation data mining such as Min-Hashing, K-Min-Hashing and Locality-Sensitive-Hashing. Experimental results on real data concerning scalability, speed-up and network traffic are reported.

Details

Database :
OpenAIRE
Journal :
2009 IEEE International Symposium on Parallel & Distributed Processing
Accession number :
edsair.doi.dedup.....68e7cc22849ac5d9e56d3ea6640ffb45
Full Text :
https://doi.org/10.1109/ipdps.2009.5161156