Back to Search Start Over

Fast k most similar neighbor classifier for mixed data (tree k-MSN)

Authors :
Hernández-Rodríguez, Selene
Martínez-Trinidad, J. Fco
Carrasco-Ochoa, J. Ariel
Source :
Pattern Recognition. Mar2010, Vol. 43 Issue 3, p873-886. 14p.
Publication Year :
2010

Abstract

Abstract: The k nearest neighbor (k-NN) classifier has been a widely used nonparametric technique in Pattern Recognition, because of its simplicity and good performance. In order to decide the class of a new prototype, the k-NN classifier performs an exhaustive comparison between the prototype to classify and the prototypes in the training set T. However, when T is large, the exhaustive comparison is expensive. For this reason, many fast k-NN classifiers have been developed, some of them are based on a tree structure, which is created during a preprocessing phase using the prototypes in T. Then, in a search phase, the tree is traversed to find the nearest neighbor. The speed up is obtained, while the exploration of some parts of the tree is avoided using pruning rules which are usually based on the triangle inequality. However, in soft sciences as Medicine, Geology, Sociology, etc., the prototypes are usually described by numerical and categorical attributes (mixed data), and sometimes the comparison function for computing the similarity between prototypes does not satisfy metric properties. Therefore, in this work an approximate fast k most similar neighbor classifier, for mixed data and similarity functions that do not satisfy metric properties, based on a tree structure (Tree k-MSN) is proposed. Some experiments with synthetic and real data are presented. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
00313203
Volume :
43
Issue :
3
Database :
Academic Search Index
Journal :
Pattern Recognition
Publication Type :
Academic Journal
Accession number :
45553507
Full Text :
https://doi.org/10.1016/j.patcog.2009.08.014