Back to Search
Start Over
Out-of-core coherent closed quasi-clique mining from large dense graph databases
- Source :
- ACM Transactions on Database Systems. June, 2007, Vol. 32 Issue 2
- Publication Year :
- 2007
-
Abstract
- Due to the ability of graphs to represent more generic and more complicated relationships among different objects, graph mining has played a significant role in data mining, attracting increasing attention in the data mining community. In addition, frequent coherent subgraphs can provide valuable knowledge about the underlying internal structure of a graph database, and mining frequently occurring coherent subgraphs from large dense graph databases has witnessed several applications and received considerable attention in the graph mining community recently. In this article, we study how to efficiently mine the complete set of coherent closed quasi-cliques from large dense graph databases, which is an especially challenging task due to the fact that the downward-closure property no longer holds. By fully exploring some properties of quasi-cliques, we propose several novel optimization techniques which can prune the unpromising and redundant subsearch spaces effectively. Meanwhile, we devise an efficient closure checking scheme to facilitate the discovery of closed quasi-cliques only. Since large databasescannot be held in main memory, we also design an out-of-core solution with efficient index structures for mining coherent closed quasi-cliques from large dense graph databases. We call this Cocain *. Thorough performance study shows that Cocain * is very efficient and scalable for large dense graph databases. Categories and Subject Descriptors: H.2.8 [Database Management]: Database Applications--Data mining General Terms: Algorithms, Performance Additional Key Words and Phrases: Graph mining, quasi-clique, coherent subgraph, frequent closed subgraph, out-of-core algorithm ACM Reference Format: Zeng, Z., Wang, J., Zhou, L., and Karypis, G. 2007. Out-of-Core coherent closed quasi-clique mining from large dense graph databases. ACM Trans. Datab. Syst. 32, 2, Article 13 (June 2007), 40 pages. DOI = 10.1145/1242524.1242530 http://doi.acm.org/10.1145/1242524.1242530
Details
- Language :
- English
- ISSN :
- 03625915
- Volume :
- 32
- Issue :
- 2
- Database :
- Gale General OneFile
- Journal :
- ACM Transactions on Database Systems
- Publication Type :
- Academic Journal
- Accession number :
- edsgcl.166751206