Back to Search Start Over

Two-Stage Feature Selection with Unsupervised Second Stage.

Authors :
Xu, Ke
Maung, Crystal
Arai, Hiromasa
Schweitzer, Haim
Source :
International Journal on Artificial Intelligence Tools; Nov2018, Vol. 27 Issue 7, pN.PAG-N.PAG, 18p
Publication Year :
2018

Abstract

Feature selection is a common dimensionality reduction technique of fundamental importance in big data. A common approach for reducing the running time of feature selection is to perform it in two stages. In the first stage a fast and simple filter is applied to select good candidates. The number of candidates is further reduced in the second stage by an accurate algorithm that may run significantly slower. There are two main variants of feature selection: unsupervised and supervised. In the supervised variant features are selected for predicting labels, while the unsupervised variant does not use labels at all. We describe a general framework that can use an arbitrary off-the-shelf unsupervised algorithm for the second stage. The algorithm is applied to the selection obtained in the first stage weighted appropriately. Our main technical result is a method for calculating weights for the columns that need to be selected in the second stage. We show that these weights can be computed as the solution to a constrained quadratic optimization problem. The solution is deterministic, and improves on previously published studies that use probabilistic ideas to compute similar weights. To the best of our knowledge our approach is the first technique for converting a supervised feature selection problem into an unsupervised problem. Complexity analysis shows that the proposed technique is very fast, can be implemented in a single pass over the data, and can take advantage of data sparsity. Experimental results show that the accuracy of the proposed method is comparable to that of much slower techniques. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02182130
Volume :
27
Issue :
7
Database :
Complementary Index
Journal :
International Journal on Artificial Intelligence Tools
Publication Type :
Academic Journal
Accession number :
132993135
Full Text :
https://doi.org/10.1142/S021821301860014X