Back to Search Start Over

Statistical computation of feature weighting schemes through data estimation for nearest neighbor classifiers.

Authors :
Sáez, José A.
Derrac, Joaquín
Luengo, Julián
Herrera, Francisco
Source :
Pattern Recognition. Dec2014, Vol. 47 Issue 12, p3941-3948. 8p.
Publication Year :
2014

Abstract

The Nearest Neighbor rule is one of the most successful classifiers in machine learning. However, it is very sensitive to noisy, redundant and irrelevant features, which may cause its performance to deteriorate. Feature weighting methods try to overcome this problem by incorporating weights into the similarity function to increase or reduce the importance of each feature, according to how they behave in the classification task. This paper proposes a new feature weighting classifier, in which the computation of the weights is based on a novel idea combining imputation methods – used to estimate a new distribution of values for each feature based on the rest of the data – and the Kolmogorov–Smirnov nonparametric statistical test to measure the changes between the original and imputed distribution of values. This proposal is compared with classic and recent feature weighting methods. The experimental results show that our feature weighting scheme is very resilient to the choice of imputation method and is an effective way of improving the performance of the Nearest Neighbor classifier, outperforming the rest of the classifiers considered in the comparisons. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00313203
Volume :
47
Issue :
12
Database :
Academic Search Index
Journal :
Pattern Recognition
Publication Type :
Academic Journal
Accession number :
98144668
Full Text :
https://doi.org/10.1016/j.patcog.2014.06.012