Back to Search Start Over

High Performance Subgraph Mining in Molecular Compounds

Authors :
Michael R. Berthold
Giuseppe Di Fatta
Source :
High Performance Computing and Communications ISBN: 9783540290315, HPCC
Publication Year :
2005
Publisher :
Springer Berlin Heidelberg, 2005.

Abstract

Structured data represented in the form of graphs arises in\ud several fields of the science and the growing amount of available data makes distributed graph mining techniques particularly relevant. In this paper, we present a distributed approach to the frequent subgraph mining\ud problem to discover interesting patterns in molecular compounds. The problem is characterized by a highly irregular search tree, whereby no reliable workload prediction is available. We describe the three main\ud aspects of the proposed distributed algorithm, namely a dynamic partitioning of the search space, a distribution process based on a peer-to-peer communication framework, and a novel receiver-initiated, load balancing\ud algorithm. The effectiveness of the distributed method has been evaluated on the well-known National Cancer Institute’s HIV-screening dataset, where the approach attains close-to linear speedup in a network\ud of workstations.

Details

ISBN :
978-3-540-29031-5
ISSN :
03029743
ISBNs :
9783540290315
Database :
OpenAIRE
Journal :
High Performance Computing and Communications ISBN: 9783540290315, HPCC
Accession number :
edsair.doi.dedup.....5a72ddc23104c4660a9a64896856465b
Full Text :
https://doi.org/10.1007/11557654_97