Back to Search
Start Over
Two-Stage Feature Selection with Unsupervised Second Stage.
- Source :
- International Journal on Artificial Intelligence Tools; Nov2018, Vol. 27 Issue 7, pN.PAG-N.PAG, 18p
- Publication Year :
- 2018
-
Abstract
- Feature selection is a common dimensionality reduction technique of fundamental importance in big data. A common approach for reducing the running time of feature selection is to perform it in two stages. In the first stage a fast and simple filter is applied to select good candidates. The number of candidates is further reduced in the second stage by an accurate algorithm that may run significantly slower. There are two main variants of feature selection: unsupervised and supervised. In the supervised variant features are selected for predicting labels, while the unsupervised variant does not use labels at all. We describe a general framework that can use an arbitrary off-the-shelf unsupervised algorithm for the second stage. The algorithm is applied to the selection obtained in the first stage weighted appropriately. Our main technical result is a method for calculating weights for the columns that need to be selected in the second stage. We show that these weights can be computed as the solution to a constrained quadratic optimization problem. The solution is deterministic, and improves on previously published studies that use probabilistic ideas to compute similar weights. To the best of our knowledge our approach is the first technique for converting a supervised feature selection problem into an unsupervised problem. Complexity analysis shows that the proposed technique is very fast, can be implemented in a single pass over the data, and can take advantage of data sparsity. Experimental results show that the accuracy of the proposed method is comparable to that of much slower techniques. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 02182130
- Volume :
- 27
- Issue :
- 7
- Database :
- Complementary Index
- Journal :
- International Journal on Artificial Intelligence Tools
- Publication Type :
- Academic Journal
- Accession number :
- 132993135
- Full Text :
- https://doi.org/10.1142/S021821301860014X