Back to Search
Start Over
A new method for mining of WWW access sequences.
- Source :
-
Electronics & Communications in Japan, Part 2: Electronics . Oct2007, Vol. 90 Issue 10, p127-138. 12p. 11 Diagrams, 4 Charts. - Publication Year :
- 2007
-
Abstract
- Analysis of access sequences is an important technique in the mining of WWW access logs. The well-known apriori algorithm is a typical method. A problem of this method is that the obtained relation between sequences is not reflected in the output. This paper proposes a new method of sequence analysis using matrix clustering. This method considers a binary matrix in which the sequences correspond to the rows and ordered pairs of pages correspond to the columns. The similarities between sequences are extracted as clusters in the matrix. Based on these clusters, super-sequences, which are generalizations of similar sequences, can be generated. The proposed method is applied to real data and the results are evaluated. It is verified that the features of entire sequences can be extracted. © 2007 Wiley Periodicals, Inc. Electron Comm Jpn Pt 2, 90(10): 127–138, 2007; Published online in Wiley InterScience (<URL>www.interscience.wiley.com</URL>). DOI 10.1002/ecjb.20394 [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 8756663X
- Volume :
- 90
- Issue :
- 10
- Database :
- Academic Search Index
- Journal :
- Electronics & Communications in Japan, Part 2: Electronics
- Publication Type :
- Academic Journal
- Accession number :
- 26847945
- Full Text :
- https://doi.org/10.1002/ecjb.20394