Back to Search Start Over

The Algorithm of Data Preprocessing in Web Log Mining Based on Cloud Computing.

Authors :
Zhang, Guanglu
Zhang, Mingxin
Source :
International Conference on Information Technology & Management Science (ICITMS 2012) Proceedings 2012; 2013, p467-474, 8p
Publication Year :
2013

Abstract

In the structure of distributed cluster server, web log data mining model based on data warehouse has the defects of bottlenecks in the network and computing, transmission errors caused by the large data transmission, the paper makes use of the advantages of cloud computing, distributed processing and virtualization technology, designs a type of Web log analysis platform based on cloud computing Hadoop cluster framework, finally, a new hybrid algorithm of distributed procession in the cloud computing environment is proposed. To further verify the efficiency of the platform, we use the improved data pretreatment algorithm on the platform of processing large number of Web logs, experimental results show that it can improve the efficiency of Web data mining. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783642349096
Database :
Complementary Index
Journal :
International Conference on Information Technology & Management Science (ICITMS 2012) Proceedings 2012
Publication Type :
Book
Accession number :
118799095
Full Text :
https://doi.org/10.1007/978-3-642-34910-2_54