Author: "Mingyue Luo" / Publisher: ieee - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mingyue Luo"' showing total 4 results

Start Over Author "Mingyue Luo" Publisher ieee

4 results on '"Mingyue Luo"'

1. Distributed log information processing with Map-Reduce: A case study from raw data to final models

Author: Gang Liu and Mingyue Luo
Subjects: Distributed database, Computer science, business.industry, Machine learning, computer.software_genre, Data modeling, Software, Data extraction, Scalability, Data pre-processing, Artificial intelligence, Data mining, Cluster analysis, Raw data, business, computer
Abstract: With the high development of Internet, e-commerce websites now routinely have to work with log datasets which are up to a few terabytes in size. How to remove messy data timely with low cost and find out useful information is a problem we have to face. The mining process involves several steps from pre-processing the raw data to establishing the final models. In this paper we describe our method to solve the problem with Map-Reduce. Hadoop[7] is a Map-Reduce implementation develops open-source software for reliable, scalable, distributed computing. Several applications which we have proposed: data extracting, sum operation, join operation and clustering algorithm are applied on hadoop. We can apply them on data pre-processing and detect users with the same interests. In particular, we focus on clustering algorithms. A clustering algorithms which integrate SOM(Self-Organized Map) and fuzzy[13] logic is combined with Map-Reduce and we call it MRSF here. With the help of hadoop cluster, large calculation of jobs with MRSF can be accommodated easily by just adding more nodes or computers to the cluster. From the experiment, we show that MRSF can scale well and efficiently process and analyze extremely large datasets.
Published: 2010

2. Clustering Algorithm on Block Division of Documents

Author: Mingyue Luo and Gang Liu
Subjects: Clustering high-dimensional data, Fuzzy clustering, k-medoids, Computer science, Correlation clustering, Single-linkage clustering, k-means clustering, Constrained clustering, computer.software_genre, Determining the number of clusters in a data set, Data stream clustering, CURE data clustering algorithm, Canopy clustering algorithm, Algorithm design, Data mining, Cluster analysis, computer, k-medians clustering, FSA-Red Algorithm
Abstract: In the traditional K-means algorithm, the selection of cluster number and the initial cluster center brings huge affection on the quality of clustering. To reduce the dependence on the initial center and to locate the types of new data rapidly, an algorithm applicable for text data is proposed. In this algorithm, document density is considered as parameter. Documents are divided into blocks first. After that every divided block is clustered separately. Experiment shows that this algorithm not only makes higher quality for clustering, but also does well in the new increasing data.
Published: 2010

3. Clustering Algorithm on Block Division of Documents.

Author: Gang Liu and Mingyue Luo
Published: 2010
Full Text: View/download PDF

4. Distributed log information processing with Map-Reduce: A case study from raw data to final models.

Author: Mingyue Luo and Gang Liu
Published: 2010
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Mingyue Luo"'

1. Distributed log information processing with Map-Reduce: A case study from raw data to final models

2. Clustering Algorithm on Block Division of Documents

3. Clustering Algorithm on Block Division of Documents.

4. Distributed log information processing with Map-Reduce: A case study from raw data to final models.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

4 results on '"Mingyue Luo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources