Back to Search Start Over

A new method for mining of WWW access sequences.

Authors :
Oyanagi, Shigeru
Kamiharako, Masatoshi
Kubota, Kazuto
Nakase, Akihiko
Source :
Electronics & Communications in Japan, Part 2: Electronics. Oct2007, Vol. 90 Issue 10, p127-138. 12p. 11 Diagrams, 4 Charts.
Publication Year :
2007

Abstract

Analysis of access sequences is an important technique in the mining of WWW access logs. The well-known apriori algorithm is a typical method. A problem of this method is that the obtained relation between sequences is not reflected in the output. This paper proposes a new method of sequence analysis using matrix clustering. This method considers a binary matrix in which the sequences correspond to the rows and ordered pairs of pages correspond to the columns. The similarities between sequences are extracted as clusters in the matrix. Based on these clusters, super-sequences, which are generalizations of similar sequences, can be generated. The proposed method is applied to real data and the results are evaluated. It is verified that the features of entire sequences can be extracted. © 2007 Wiley Periodicals, Inc. Electron Comm Jpn Pt 2, 90(10): 127–138, 2007; Published online in Wiley InterScience (<URL>www.interscience.wiley.com</URL>). DOI 10.1002/ecjb.20394 [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
8756663X
Volume :
90
Issue :
10
Database :
Academic Search Index
Journal :
Electronics & Communications in Japan, Part 2: Electronics
Publication Type :
Academic Journal
Accession number :
26847945
Full Text :
https://doi.org/10.1002/ecjb.20394