1. A Strategy on Selecting Performance Metrics for Classifier Evaluation
- Author
-
Yangguang Liu, Shiting Wen, Yangming Zhou, and Chaogang Tang
- Subjects
Receiver operating characteristic ,Mean squared error ,Computer Networks and Communications ,business.industry ,Computer science ,Mean absolute error ,Machine learning ,computer.software_genre ,Spearman's rank correlation coefficient ,Correlation ,Cohen's kappa ,Artificial intelligence ,Data mining ,Linear correlation ,business ,computer ,Classifier (UML) - Abstract
The evaluation of classifiers' performances plays a critical role in construction and selection of classification model. Although many performance metrics have been proposed in machine learning community, no general guidelines are available among practitioners regarding which metric to be selected for evaluating a classifier's performance. In this paper, we attempt to provide practitioners with a strategy on selecting performance metrics for classifier evaluation. Firstly, the authors investigate seven widely used performance metrics, namely classification accuracy, F-measure, kappa statistic, root mean square error, mean absolute error, the area under the receiver operating curve, and the area under the precision-recall curve. Secondly, the authors resort to using Pearson linear correlation and Spearman rank correlation to analyses the potential relationship among these seven metrics. Experimental results show that these commonly used metrics can be divided into three groups, and all metrics within a given group are highly correlated but less correlated with metrics from different groups.
- Published
- 2014
- Full Text
- View/download PDF