Back to Search Start Over

Finding representative patterns withordered projections

Authors :
Miguel Toro
Jesús S. Aguilar-Ruiz
José C. Riquelme
Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos
Comisión Interministerial de Ciencia y Tecnología (CICYT). España
Source :
idUS. Depósito de Investigación de la Universidad de Sevilla, instname
Publication Year :
2003
Publisher :
Elsevier, 2003.

Abstract

This paper presents a new approach to 2nding representative patterns for dataset editing. The algorithm patterns by ordered projections (POP), has some interesting characteristics: important reduction of the number of instances from the dataset; lower computational cost (� (mn log n)) with respect to other typical algorithms due to the absence of distance calculations; conservation of the decision boundaries, especially from the point of view of the application of axis-parallel classi2ers. POP works well in practice withbothcontinuous and discrete attributes. The performance of POP is analysed in two ways: percentage of reduction and classi2cation. POP has been compared to IB2, ENN and SHRINK concerning the percentage of reduction and the computational cost. In addition, we have analysed the accuracy of k-NN and C4.5 after applying the reduction techniques. An extensive empirical study using datasets with continuous and discrete attributes from the UCI repository shows that POP is a valuable preprocessing method for the later application of any axis-parallel learning algorithm. ? 2002 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

Details

Database :
OpenAIRE
Journal :
idUS. Depósito de Investigación de la Universidad de Sevilla, instname
Accession number :
edsair.doi.dedup.....a397b094f610eba7e356bcadbc5acc08