Back to Search Start Over

Feature Selection Using Firefly Algorithm With Tree-Based Classification In Software Defect Prediction

Authors :
Vina Maulida
Rudy Herteno
Dwi Kartini
Friska Abadi
Mohammad Reza Faisal
Source :
Journal of Electronics, Electromedical Engineering, and Medical Informatics, Vol 5, Iss 4, Pp 223-230 (2023)
Publication Year :
2023
Publisher :
Poltekkes Kemenkes Surabaya, 2023.

Abstract

Defects that occur in software products are a universal occurrence. Software defect prediction is usually carried out to determine the performance, accuracy, precision and performance of the prediction model or method used in research, using various kinds of datasets. Software defect prediction is one of the Software Engineering studies that is of great concern to researchers. This research was conducted to determine the performance of tree-based classification algorithms including Decision Trees, Random Forests and Deep Forests without using feature selection and using firefly feature selection. And also know the tree-based classification algorithm with firefly feature selection which can provide better software defect prediction performance. The dataset used in this study is the ReLink dataset which consists of Apache, Safe and Zxing. Then the data is divided into testing data and training data with 10-fold cross validation. Then feature selection is performed using the Firefly Algorithm. Each ReLink dataset will be processed by each tree-based classification algorithm, namely Decision Tree, Random Forest and Deep Forest according to the results of the firefly feature selection. Performance evaluation uses the AUC value (Area under the ROC Curve). Research was conducted using google collab and the average AUC value generated by Firefly-Decision Tree is 0.66, the average AUC value generated by Firefly-Random Forest is 0.77, and the average AUC value generated by Firefly-Deep Forest is 0, 76. The results of this study indicate that the approach using the Firefly algorithm with Random Forest classification can work better in predicting software damage compared to other tree-based algorithms. In previous studies, tree-based classification with hyperparameter tuning on software defect prediction datasets obtained quite good results. In another study, the classification performance of SVM, Naïve Bayes and K-nearest neighbor with firefly feature selection resulted in improved performance. Therefore, this research was conducted to determine the performance of a tree-based algorithm using the firefly selection feature.

Details

Language :
English
ISSN :
26568632
Volume :
5
Issue :
4
Database :
Directory of Open Access Journals
Journal :
Journal of Electronics, Electromedical Engineering, and Medical Informatics
Publication Type :
Academic Journal
Accession number :
edsdoj.f2535a63ea3644649eeabc3ab835bbb3
Document Type :
article
Full Text :
https://doi.org/10.35882/jeeemi.v5i4.315