Back to Search Start Over

Feature selection method for software defect number prediction based on maximum information coefficient.

Authors :
LIU Guoqing
WANG Xingqi
WEI Dan
FANG Jinglong
SHAO Yanli
Source :
Telecommunications Science; May2021, Vol. 37 Issue 5, p133-147, 15p
Publication Year :
2021

Abstract

The traditional feature selection method only considers the linear correlation between variables and ignores the nonlinear correlation, so it is difficult to select effective feature subsets to build the effective model to predict the number of faults in software modules. Considering the linear and nonlinear relationship, a feature selection method based on maximum information coefficient (MIC) was proposed. The proposed method separated the redundancy analysis and correlation analysis into two phases. In the previous phase, the cluster algorithm, which was based on the correlation between features, was used to divide the redundant features into the same cluster. In the later phase, the features in each cluster were sorted in descending order according to the correlation between features and the number of software defects, and then the top features were selected to form the feature subset. The experimental results show that the proposed method can improve the prediction performance of software defect number prediction model by effectively removing redundant and irrelevant features. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10000801
Volume :
37
Issue :
5
Database :
Complementary Index
Journal :
Telecommunications Science
Publication Type :
Academic Journal
Accession number :
151031500