Back to Search Start Over

Fast k Most Similar Neighbor Classifier for Mixed Data Based on Approximating and Eliminating.

Authors :
Hernández-Rodríguez, Selene
Carrasco-Ochoa, J. Ariel
Martínez-Trinidad, J. Fco.
Source :
Advances in Knowledge Discovery & Data Mining: 12th Pacific-Asia Conference, Pakdd 2008 Osaka, Japan, May 20-23, 2008 Proceedings; 2008, p697-704, 8p
Publication Year :
2008

Abstract

The k nearest neighbor (k-NN) classifier has been a widely used nonparametric technique in Pattern Recognition. In order to decide the class of a new prototype, the k-NN classifier performs an exhaustive comparison between the prototype to classify (query) and the prototypes in the training set T. However, when T is large, the exhaustive comparison is expensive. To avoid this problem, many fast k-NN algorithms have been developed. Some of these algorithms are based on Approximating-Eliminating search. In this case, the Approximating and Eliminating steps rely on the triangle inequality. However, in soft sciences, the prototypes are usually described by qualitative and quantitative features (mixed data), and sometimes the comparison function does not satisfy the triangle inequality. Therefore, in this work, a fast k most similar neighbour classifier for mixed data (AEMD) is presented. This classifier consists of two phases. In the first phase, a binary similarity matrix among the prototypes in T is stored. In the second phase, new Approximating and Eliminating steps, which are not based on the triangle inequality, are presented. The proposed classifier is compared against other fast k-NN algorithms, which are adapted to work with mixed data. Some experiments with real datasets are presented. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540681243
Database :
Complementary Index
Journal :
Advances in Knowledge Discovery & Data Mining: 12th Pacific-Asia Conference, Pakdd 2008 Osaka, Japan, May 20-23, 2008 Proceedings
Publication Type :
Book
Accession number :
76805147
Full Text :
https://doi.org/10.1007/978-3-540-68125-0_66