Back to Search Start Over

Robust instance-dependent cost-sensitive classification.

Authors :
De Vos, Simon
Vanderschueren, Toon
Verdonck, Tim
Verbeke, Wouter
Source :
Advances in Data Analysis & Classification; Dec2023, Vol. 17 Issue 4, p1057-1079, 23p
Publication Year :
2023

Abstract

Instance-dependent cost-sensitive (IDCS) learning methods have proven useful for binary classification tasks where individual instances are associated with variable misclassification costs. However, we demonstrate in this paper by means of a series of experiments that IDCS methods are sensitive to noise and outliers in relation to instance-dependent misclassification costs and their performance strongly depends on the cost distribution of the data sample. Therefore, we propose a generic three-step framework to make IDCS methods more robust: (i) detect outliers automatically, (ii) correct outlying cost information in a data-driven way, and (iii) construct an IDCS learning method using the adjusted cost information. We apply this framework to cslogit, a logistic regression-based IDCS method, to obtain its robust version, which we name r-cslogit. The robustness of this approach is introduced in steps (i) and (ii), where we make use of robust estimators to detect and impute outlying costs of individual instances. The newly proposed r-cslogit method is tested on synthetic and semi-synthetic data and proven to be superior in terms of savings compared to its non-robust counterpart for variable levels of noise and outliers. All our code is made available online at https://github.com/SimonDeVos/Robust-IDCS. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
18625347
Volume :
17
Issue :
4
Database :
Complementary Index
Journal :
Advances in Data Analysis & Classification
Publication Type :
Periodical
Accession number :
173106005
Full Text :
https://doi.org/10.1007/s11634-022-00533-3