Descriptor: "NEAREST-NEIGHBOR CLASSIFICATION" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"NEAREST-NEIGHBOR CLASSIFICATION"' showing total 23 results

Start Over Descriptor "NEAREST-NEIGHBOR CLASSIFICATION"

23 results on '"NEAREST-NEIGHBOR CLASSIFICATION"'

1. Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids.

Author: Whitelam, Stephen
Subjects: image recognition, nearest-neighbor classification, stochastic sampling, cs.LG, cond-mat.stat-mech, stat.ML, Fluids & Plasmas, Mathematical Sciences, Physical Sciences
Abstract: A conceptually simple way to classify images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data cover configuration space. Here we show that this coverage can be substantially increased using coarse-graining (replacing groups of images by their centroids) and stochastic sampling (using distinct sets of centroids in combination). We use the MNIST and Fashion-MNIST data sets to show that a principled coarse-graining algorithm can convert training images into fewer image centroids without loss of accuracy of classification of test-set images by nearest-neighbor classification. Distinct batches of centroids can be used in combination as a means of stochastically sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. On the MNIST and Fashion-MNIST data sets this approach converts nearest-neighbor classification from a mid-ranking- to an upper-ranking member of the set of classical machine-learning techniques.
Published: 2021

2. Building Footprint Extraction from Very-High-Resolution Satellite Image Using Object-Based Image Analysis (OBIA) Technique

Author: Prathiba, A. P., Rastogi, Kriti, Jain, Gaurav V., Govind Kumar, V. V., di Prisco, Marco, Series Editor, Chen, Sheng-Hong, Series Editor, Vayas, Ioannis, Series Editor, Kumar Shukla, Sanjay, Series Editor, Sharma, Anuj, Series Editor, Kumar, Nagesh, Series Editor, Wang, Chien Ming, Series Editor, Ghosh, Jayanta Kumar, editor, and da Silva, Irineu, editor
Published: 2020
Full Text: View/download PDF

3. Identification of Remote Sensing-Based Land Cover Types Combining Nearest-Neighbor Classification and SEaTH Algorithm.

Author: Zhao, Jinling, Fang, Yan, Zhang, Mingmei, and Dong, Yingying
Abstract: The development of spaceborne remote sensing has greatly facilitated the land cover mapping at various spatial scales. Classification accuracy, however, is usually affected by the heterogeneous spectra of different land cover types for medium–low-spatial-resolution images. The study is aimed at improving the classification accuracy at a city scale by proposing a hierarchical classification method. Time-series Landsat-5 and Landsat-8 Operational Land Imager remote sensing images of 4 years were used as the classified images. A total of six first-class land cover types were determined, namely woodland, grassland, cropland, wetland, artificial surface and others. The object-based image analysis was chosen over pixel-based approaches. More specifically, the nearest-neighbor (NN) classification and SEparability and THresholds (SEaTH) algorithm were combined to produce a hierarchical classification method (NN-SEaTH). SEaTH algorithm was first used to extract the wetland after performing image segmentation in eCognition Developer. Then, the non-wetland was further classified to vegetation and non-vegetation by using a normalized difference vegetation index image. Finally, the other types were then obtained using the NN classification. To validate the proposed method, the NN classifier and NN-SEaTH method were compared. The proposed technique is shown to increase the overall accuracy (OA) and kappa coefficient (k) for the 4 years. The OA and k are, respectively, 96.46% and 0.9231, 96.63% and 0.9269, 96.88% and 0.9394, 95.22% and 0.9239 that are much larger than 88.13% and 0.7503, 88.83% and 0.7660, 88.64% and 0.7630, 87.33% and 0.7371 derived from the NN approach. The study provides a reference for medium-resolution-based land cover mapping by a hierarchical classification. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

4. Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids

Author: Stephen Whitelam
Subjects: image recognition, nearest-neighbor classification, stochastic sampling, Science, Astrophysics, QB460-466, Physics, QC1-999
Abstract: A conceptually simple way to classify images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data cover configuration space. Here we show that this coverage can be substantially increased using coarse-graining (replacing groups of images by their centroids) and stochastic sampling (using distinct sets of centroids in combination). We use the MNIST and Fashion-MNIST data sets to show that a principled coarse-graining algorithm can convert training images into fewer image centroids without loss of accuracy of classification of test-set images by nearest-neighbor classification. Distinct batches of centroids can be used in combination as a means of stochastically sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. On the MNIST and Fashion-MNIST data sets this approach converts nearest-neighbor classification from a mid-ranking- to an upper-ranking member of the set of classical machine-learning techniques.
Published: 2021
Full Text: View/download PDF

5. Weakly Aligned Multi-part Bag-of-Poses for Action Recognition from Depth Cameras

Author: Seidenari, Lorenzo, Varano, Vincenzo, Berretti, Stefano, Del Bimbo, Alberto, Pala, Pietro, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Petrosino, Alfredo, editor, Maddalena, Lucia, editor, and Pala, Pietro, editor
Published: 2013
Full Text: View/download PDF

6. Large width nearest prototype classification on general distance spaces.

Author: Anthony, Martin and Ratsaby, Joel
Subjects: *PROTOTYPE equipment, *PROTOTYPES, *TRIANGLE inequality, *FUNCTIONAL analysis, *BINARY codes
Abstract: In this paper we consider the problem of learning nearest-prototype classifiers in any finite distance space; that is, in any finite set equipped with a distance function. An important advantage of a distance space over a metric space is that the triangle inequality need not be satisfied, which makes our results potentially very useful in practice. We consider a family of binary classifiers for learning nearest-prototype classification on distance spaces, building on the concept of large-width learning which we introduced and studied in earlier works. Nearest-prototype is a more general version of the ubiquitous nearest-neighbor classifier: a prototype may or may not be a sample point. One advantage in the approach taken in this paper is that the error bounds depend on a ‘width’ parameter, which can be sample-dependent and thereby yield a tighter bound. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

7. The Choice of Reference Points in Best-Match File Searching.

Author: Shapiro, Marvin
Subjects: *ALGORITHMS, *COMPUTER simulation, *ELECTROMECHANICAL analogies, *MATHEMATICAL models, *SIMULATION methods & models, *ARITHMETIC, *MODELS & modelmaking, *ENGINEERING models, *MECHANICS (Physics)
Abstract: Improvements to the exhaustive search method of best-match file searching have previously been achieved by doing a preprocessing step involving the calculation of distances from a reference point. This paper discusses the proper choice of reference points and extends the previous algorithm to use more than one reference point. It is shown that reference points should be located outside of data clusters. The results of computer simulations are presented which show that large improvements can be achieved by the proper choice and location of multiple reference points. [ABSTRACT FROM AUTHOR]
Published: 1977
Full Text: View/download PDF

8. OPM2L: An optimal instance partition-based multi-metric learning method for heterogeneous dataset classification.

Author: Deng, Huiyuan, Meng, Xiangzhu, Wang, Huibing, and Feng, Lin
Subjects: *KEYWORD searching, *RIEMANNIAN manifolds, *MEASURING instruments, *CLASSIFICATION
Abstract: Multi-metric learning -a method to learn multiple local metrics to reveal the feature's correlations of samples from different local regions-has become an essential tool to measure the similarities between instances from heterogeneous datasets. However, most existing cluster-based MML methods first partition the training data with a predefined metric and then learn multiple metrics via the local instances, leading to these two independent procedures fail to cooperate with each other. In this paper, we propose an Optimal instance Partition-based Multi-Metric Learning (OPM2L) method for heterogeneous dataset classification by unifying the instance partition and multiple local metrics learning into a single objective. In particular, multiple anchor centers together with a global metric are employed to assist the instance partition process. During the training, the shared information contained in local metrics is aggregated into the global metric by a dedicated regularizer, which improves the instance partition process and offers the subsequent multiple local metrics learning with more informative instances. Moreover, an efficient alternating direction technology is employed to seek a feasible solution to the proposed method. We further confirmed that the sub-problems can be settled with closed-form solutions, while the superiority of the proposed method is also proved by experimental results on extensive datasets. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

9. An efficient method for clustered multi-metric learning.

Author: Nguyen, Bac, Ferri, Francesc J., Morell, Carlos, and De Baets, Bernard
Subjects: *KERNEL (Mathematics), *KERNEL functions, *SUPPORT vector machines, *GEOMETRIC function theory, *COMPLEX variables
Abstract: Abstract Distance metric learning, which aims at finding a distance metric that separates examples of one class from examples of the other classes, is the key to the success of many machine learning tasks. Although there has been an increasing interest in this field, learning a global distance metric is insufficient to obtain satisfactory results when dealing with heterogeneously distributed data. A simple solution to tackle this kind of data is based on kernel embedding methods. However, it quickly becomes computationally intractable as the number of examples increases. In this paper, we propose an efficient method that learns multiple local distance metrics instead of a single global one. More specifically, the training examples are divided into several disjoint clusters, in each of which a distance metric is trained to separate the data locally. Additionally, a global regularization is introduced to preserve some common properties of different clusters in the learned metric space. By learning multiple distance metrics jointly within a single unified optimization framework, our method consistently outperforms single distance metric learning methods, while being more efficient than other state-of-the-art multi-metric learning methods. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

10. Improved Search of Relevant Points for Nearest-Neighbor Classification

Author: Flores-Velazco, Alejandro
Subjects: Computational Geometry (cs.CG), FOS: Computer and information sciences, border points, Computer Science - Machine Learning, decision boundaries, nearest-neighbor rule, Theory of computation → Computational geometry, relevant points, Computer Science - Computational Geometry, nearest-neighbor classification, Machine Learning (cs.LG)
Abstract: Given a training set P ⊂ ℝ^d, the nearest-neighbor classifier assigns any query point q ∈ ℝ^d to the class of its closest point in P. To answer these classification queries, some training points are more relevant than others. We say a training point is relevant if its omission from the training set could induce the misclassification of some query point in ℝ^d. These relevant points are commonly known as border points, as they define the boundaries of the Voronoi diagram of P that separate points of different classes. Being able to compute this set of points efficiently is crucial to reduce the size of the training set without affecting the accuracy of the nearest-neighbor classifier. Improving over a decades-long result by Clarkson (FOCS'94), Eppstein (SOSA’22) recently proposed an output-sensitive algorithm to find the set of border points of P in 𝒪(n² + nk²) time, where k is the size of such set. In this paper, we improve this algorithm to have time complexity equal to 𝒪(nk²) by proving that the first phase of their algorithm, which requires 𝒪(n²) time, are unnecessary., LIPIcs, Vol. 244, 30th Annual European Symposium on Algorithms (ESA 2022), pages 54:1-54:10
Published: 2022

11. Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features

Author: Lei Chen, ZhanDong Li, Tao Zeng, Yu-Hang Zhang, ShiQi Zhang, Tao Huang, and Yu-Dong Cai
Subjects: KEGG enrichment, Computer science, Functional features, Feature selection, Computational biology, protein subcellular location, QH426-470, COMPLEX-I, feature selection, Protein Annotation, CYTOPLASMIC FILAMENTS, Interaction network, Genetics, AMINO-ACID-COMPOSITION, CELL, GO enrichment, NDUFS3 SUBUNIT, Genetics (clinical), Original Research, STRING DATABASE, LOCALIZATION, Subcellular localization, Statistical classification, Functional annotation, FEATURE-SELECTION, Molecular Medicine, NEAREST-NEIGHBOR CLASSIFICATION, protein-protein interaction network, classification algorithm, MALATE-DEHYDROGENASE, Function (biology)
Abstract: Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein–protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein–protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein-protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein-protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance.
Published: 2021
Full Text: View/download PDF

12. Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features

Author: Chen, Lei, Li, ZhanDong, Zeng, Tao, Zhang, Yu-Hang, Zhang, ShiQi, Huang, Tao, Cai, Yu-Dong, Chen, Lei, Li, ZhanDong, Zeng, Tao, Zhang, Yu-Hang, Zhang, ShiQi, Huang, Tao, and Cai, Yu-Dong
Abstract: Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein–protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein–protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance, Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein-protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein-protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance.
Published: 2021

13. Online equivalence learning through a Quasi-Newton method.

Author: Le Capitaine, Hoel
Abstract: Recently, the community has shown a growing interest in building online learning models. In this paper, we are interested in the framework of fuzzy equivalences obtained by residual implications. Models are generally based on the relevance degree between pairs of objects of the learning set, and the update is obtained by using a standard stochastic (online) gradient descent. This paper proposes another method for learning fuzzy equivalences using a Quasi-Newton optimization. The two methods are extensively compared on real data sets for the task of nearest sample(s) classification. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

14. Boundary-Sensitive Approach for Approximate Nearest-Neighbor Classification

Author: Flores-Velazco, Alejandro and Mount, David M.
Subjects: space-time tradeoffs, geometric data structures, approximate nearest-neighbor searching, Theory of computation → Computational geometry, nearest-neighbor classification
Abstract: The problem of nearest-neighbor classification is a fundamental technique in machine-learning. Given a training set P of n labeled points in ℝ^d, and an approximation parameter 0 < ε ≤ 1/2, any unlabeled query point should be classified with the class of any of its ε-approximate nearest-neighbors in P. Answering these queries efficiently has been the focus of extensive research, proposing techniques that are mainly tailored towards resolving the more general problem of ε-approximate nearest-neighbor search. While the latest can only hope to provide query time and space complexities dependent on n, the problem of nearest-neighbor classification accepts other parameters more suitable to its analysis. Such is the number k_ε of ε-border points, which describes the complexity of boundaries between sets of points of different classes. This paper presents a new data structure called Chromatic AVD. This is the first approach for ε-approximate nearest-neighbor classification whose space and query time complexities are only dependent on ε, k_ε and d, while being independent on both n and Δ, the spread of P., LIPIcs, Vol. 204, 29th Annual European Symposium on Algorithms (ESA 2021), pages 44:1-44:15
Published: 2021
Full Text: View/download PDF

15. Improved learning of I2C distance and accelerating the neighborhood search for image classification

Author: Wang, Zhengxiang, Hu, Yiqun, and Chia, Liang-Tien
Subjects: *MACHINE learning, *IMAGE processing, *NEAREST neighbor analysis (Statistics), *VARIANCES, *PERFORMANCE, *COST effectiveness, *CLASSIFICATION
Abstract: Abstract: Image-to-class (I2C) distance is a novel measure for image classification and has successfully handled datasets with large intra-class variances. However, due to the lack of a training phase, the performance of this distance is easily affected by irrelevant local features that may hurt the classification accuracy. Besides, the success of this I2C distance relies heavily on the large number of local features in the training set, which requires expensive computation cost for classifying test images. On the other hand, if there are small number of local features in the training set, it may result in poor performance. In this paper, we propose a distance learning method to improve the classification accuracy of this I2C distance as well as two strategies for accelerating its NN search. We first propose a large margin optimization framework to learn the I2C distance function, which is modeled as a weighted combination of the distance from every local feature in an image to its nearest-neighbor (NN) in a candidate class. We learn these weights associated with local features in the training set by constraining the optimization such that the I2C distance from image to its belonging class should be less than that to any other class. We evaluate the proposed method on several publicly available image datasets and show that the performance of I2C distance for classification can significantly be improved by learning a weighted I2C distance function. To improve the computation cost, we also propose two methods based on spatial division and hubness score to accelerate the NN search, which is able to largely reduce the on-line testing time while still preserving or even achieving a better classification accuracy. [Copyright &y& Elsevier]
Published: 2011
Full Text: View/download PDF

16. Unifying Instance-Based and Rule-Based Induction.

Author: Domingos, Pedro
Abstract: Several well-developed approaches to inductive learning now exist, but each has specific limitations that are hard to overcome. Multi-strategy learning attempts to tackle this problem by combining multiple methods in one algorithm. This article describes a unification of two widely-used empirical approaches: rule induction and instance-based learning. In the new algorithm, instances are treated as maximally specific rules, and classification is performed using a best-match strategy. Rules are learned by gradually generalizing instances until no improvement in apparent accuracy is obtained. Theoretical analysis shows this approach to be efficient. It is implemented in the RISE 3.1 system. In an extensive empirical study, RISE consistently achieves higher accuracies than state-of-the-art representatives of both its parent approaches (PEBLS and CN2), as well as a decision tree learner (C4.5). Lesion studies show that each of RISE's components is essential to this performance. Most significantly, in 14 of the 30 domains studied, RISE is more accurate than the best of PEBLS and CN2, showing that a significant synergy can be obtained by combining multiple empirical methods. [ABSTRACT FROM AUTHOR]
Published: 1996
Full Text: View/download PDF

17. Prototype-based models in machine learning

Subjects: Computer Science::Machine Learning, ORGANIZING FEATURE MAPS, NEURAL-GAS NETWORK, STRUCTURED DATA, ALGORITHMS, NEAREST-NEIGHBOR CLASSIFICATION, VECTOR QUANTIZATION, LVQ, PRESERVATION, DATA VISUALIZATION, SOM
Abstract: An overview is given of prototype-based models in machine learning. In this framework, observations, i.e., data, are stored in terms of typical representatives. Together with a suitable measure of similarity, the systems can be employed in the context of unsupervised and supervised analysis of potentially high-dimensional, complex datasets. We discuss basic schemes of competitive vector quantization as well as the so-called neural gas approach and Kohonen's topology-preserving self-organizing map. Supervised learning in prototype systems is exemplified in terms of learning vector quantization. Most frequently, the familiar Euclidean distance serves as a dissimilarity measure. We present extensions of the framework to nonstandard measures and give an introduction to the use of adaptive distances in relevance learning. (C) 2016 Wiley Periodicals, Inc.
Published: 2016
Full Text: View/download PDF

18. Guarantees on nearest-neighbor condensation heuristics.

Author: Flores-Velazco, Alejandro and Mount, David
Subjects: *HEURISTIC, *SURETYSHIP & guaranty, *CONDENSATION
Abstract: The problem of nearest-neighbor condensation aims to reduce the size of a training set of a nearest-neighbor classifier while maintaining its classification accuracy. Although many condensation techniques have been proposed, few bounds have been proved on the amount of reduction achieved. In this paper, we present one of the first theoretical results for practical nearest-neighbor condensation algorithms. We propose two condensation algorithms, called RSS and VSS, along with provable upper-bounds on the size of their selected subsets. Additionally, we shed light on the selection size of two well known condensation algorithms, called MSS and FCNN, and compare them to the new algorithms. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

19. Prototype-based models in machine learning

Author: Biehl, Michael, Hammer, Barbara, Villmann, Thomas, and Intelligent Systems
Subjects: Computer Science::Machine Learning, ORGANIZING FEATURE MAPS, Neurons, STRUCTURED DATA, ALGORITHMS, Statistics as Topic, VECTOR QUANTIZATION, SOM, Pattern Recognition, Automated, Machine Learning, NEURAL-GAS NETWORK, Data Mining, Computer Simulation, NEAREST-NEIGHBOR CLASSIFICATION, LVQ, PRESERVATION, Neural Networks, Computer, DATA VISUALIZATION
Abstract: An overview is given of prototype-based models in machine learning. In this framework, observations, i.e., data, are stored in terms of typical representatives. Together with a suitable measure of similarity, the systems can be employed in the context of unsupervised and supervised analysis of potentially high-dimensional, complex datasets. We discuss basic schemes of competitive vector quantization as well as the so-called neural gas approach and Kohonen's topology-preserving self-organizing map. Supervised learning in prototype systems is exemplified in terms of learning vector quantization. Most frequently, the familiar Euclidean distance serves as a dissimilarity measure. We present extensions of the framework to nonstandard measures and give an introduction to the use of adaptive distances in relevance learning. (C) 2016 Wiley Periodicals, Inc.
Published: 2015

20. Cluster-based adaptive metric classification

Author: Nicolai Petkov, Ioannis Giotis, and Intelligent Systems
Subjects: Cognitive Neuroscience, Adaptive metric, Principal component analysis, Bayes' theorem, NUMBER, Artificial Intelligence, One-class classification, Prototype-based classification, MAXIMUM-LIKELIHOOD, Mathematics, Mahalanobis distance, business.industry, Pattern recognition, Class (biology), Gap statistic, Computer Science Applications, Statistical classification, ComputingMethodologies_PATTERNRECOGNITION, Classification rule, Bayes' rule, Metric (mathematics), Cluster estimation, Artificial intelligence, NEAREST-NEIGHBOR CLASSIFICATION, business
Abstract: Introducing adaptive metric has been shown to improve the results of distance-based classification algorithms. Existing methods are often computationally intensive, either in the training or in the classification phase. We present a novel algorithm that we call Cluster-Based Adaptive Metric (CLAM) classification. It first determines the number of clusters in each class of a training set and then computes the parameters of a Mahalanobis distance for each cluster. The derived Mahalanobis distances are then used to estimate the probability of cluster- and, subsequently, class-membership. We compare the proposed algorithm with other classification algorithms using 10 different data sets. The proposed CLAM algorithm is as effective as other adaptive metric classification algorithms yet it is simpler to use and in many cases computationally more efficient. (C) 2011 Elsevier B.V. All rights reserved.
Published: 2012

21. Optimized Nearest-Neighbor Classifiers Using Generated Instances

Author: Fuchs, Matthias and Abecker, Andreas
Subjects: Instance-based Learning, Genetic Algorithm, ddc:004, Nearest-Neighbor Classification
Abstract: We present a novel approach to classification, based on a tight coupling of instancebased learning and a genetic algorithm. In contrast to the usual instance-based learning setting, we do not rely on (parts of) the given training set as the basis of a nearestneighbor classifier, but we try to employ artificially generated instances as concept prototypes. The extremely hard problem of finding an appropriate set of concept prototypes is tackled by a genetic search procedure with the classification accuracy on the given training set as evaluation criterion for the genetic fitness measure. Experiments with artificial datasets show that - due to the ability to find concise and accurate concept descriptions that contain few, but typical instances - this classification approach is considerably robust against noise, untypical training instances and irrelevant attributes. These favorable (theoretical) properties are corroborated using a number of hard real-world classification problems.
Published: 1996

22. Contributions to metric learning for nearest neighbor classification.

Author: Phaibulpanich, Akarin
Subjects: Contributions, Dimension Reduction, Metric Learning, Nearest-neighbor Classification
Abstract: In this thesis, we develop methods for constructing an A-weighted metric (x - y)' A( x - y) that improves the performance of K-nearest neighbor (KNN) classifiers. KNN is known to be highly flexible, but can be somewhat inefficient and unstable. By incorporating a parametrically optimized metric into KNN, global dimension reduction is carried out efficiently, leaving the most difficult nonlinear features of the problem to be solved on a low dimensional projected feature space. Optimization over A is done by formulating a probability model that captures KNN's essential property---using only a local neighborhood of training cases to predict the class of a test case. The expected correct vote margin can be calculated under the probability model and optimized over A using gradient methods to yield a metric that is adapted to a particular problem. This framework incorporates variable selection as well as variate selection, in which certain linear combinations of the variables are deemed either informative or completely uninformative. The estimated A matrix can be used for both classification and data analysis, as it contains information about which features are informative (either in a linear or nonlinear sense), or completely uninformative about class membership. In this thesis, an approach for optimizing the KNN metric is derived, algorithms are developed, properties of the method are explored, and the performance of the method is evaluated using both simulated and real data.
Published: 2006

23. Optimized Nearest-Neighbor Classifiers Using Generated Instances

Author: Fuchs, Matthias, Abecker, Andreas, Fuchs, Matthias, and Abecker, Andreas
Abstract: We present a novel approach to classification, based on a tight coupling of instancebased learning and a genetic algorithm. In contrast to the usual instance-based learning setting, we do not rely on (parts of) the given training set as the basis of a nearestneighbor classifier, but we try to employ artificially generated instances as concept prototypes. The extremely hard problem of finding an appropriate set of concept prototypes is tackled by a genetic search procedure with the classification accuracy on the given training set as evaluation criterion for the genetic fitness measure. Experiments with artificial datasets show that - due to the ability to find concise and accurate concept descriptions that contain few, but typical instances - this classification approach is considerably robust against noise, untypical training instances and irrelevant attributes. These favorable (theoretical) properties are corroborated using a number of hard real-world classification problems.

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

23 results on '"NEAREST-NEIGHBOR CLASSIFICATION"'

1. Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids.

2. Building Footprint Extraction from Very-High-Resolution Satellite Image Using Object-Based Image Analysis (OBIA) Technique

3. Identification of Remote Sensing-Based Land Cover Types Combining Nearest-Neighbor Classification and SEaTH Algorithm.

4. Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids

5. Weakly Aligned Multi-part Bag-of-Poses for Action Recognition from Depth Cameras

6. Large width nearest prototype classification on general distance spaces.

7. The Choice of Reference Points in Best-Match File Searching.

8. OPM2L: An optimal instance partition-based multi-metric learning method for heterogeneous dataset classification.

9. An efficient method for clustered multi-metric learning.

10. Improved Search of Relevant Points for Nearest-Neighbor Classification

11. Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features

12. Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features

13. Online equivalence learning through a Quasi-Newton method.

14. Boundary-Sensitive Approach for Approximate Nearest-Neighbor Classification

15. Improved learning of I2C distance and accelerating the neighborhood search for image classification

16. Unifying Instance-Based and Rule-Based Induction.

17. Prototype-based models in machine learning

18. Guarantees on nearest-neighbor condensation heuristics.

19. Prototype-based models in machine learning

20. Cluster-based adaptive metric classification

21. Optimized Nearest-Neighbor Classifiers Using Generated Instances

22. Contributions to metric learning for nearest neighbor classification.

23. Optimized Nearest-Neighbor Classifiers Using Generated Instances

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

23 results on '"NEAREST-NEIGHBOR CLASSIFICATION"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources