Back to Search
Start Over
A Survey of k Nearest Neighbor Algorithms for Solving the Class Imbalanced Problem.
- Source :
- Wireless Communications & Mobile Computing; 3/3/2021, p1-12, 12p
- Publication Year :
- 2021
-
Abstract
- k nearest neighbor (k NN) is a simple and widely used classifier; it can achieve comparable performance with more complex classifiers including decision tree and artificial neural network. Therefore, k NN has been listed as one of the top 10 algorithms in machine learning and data mining. On the other hand, in many classification problems, such as medical diagnosis and intrusion detection, the collected training sets are usually class imbalanced. In class imbalanced data, although positive examples are heavily outnumbered by negative ones, positive examples usually carry more meaningful information and are more important than negative examples. Similar to other classical classifiers, k NN is also proposed under the assumption that the training set has approximately balanced class distribution, leading to its unsatisfactory performance on imbalanced data. In addition, under a class imbalanced scenario, the global resampling strategies that are suitable to decision tree and artificial neural network often do not work well for k NN, which is a local information-oriented classifier. To solve this problem, researchers have conducted many works for k NN over the past decade. This paper presents a comprehensive survey of these works according to their different perspectives and analyzes and compares their characteristics. At last, several future directions are pointed out. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 15308669
- Database :
- Complementary Index
- Journal :
- Wireless Communications & Mobile Computing
- Publication Type :
- Academic Journal
- Accession number :
- 149334946
- Full Text :
- https://doi.org/10.1155/2021/5520990