Back to Search Start Over

Machine learning interpretability for a stress scenario generation in credit scoring based on counterfactuals

Authors :
Andreas C. Bueff
Mateusz Cytryński
Raffaella Calabrese
Matthew Jones
John Roberts
Jonathon Moore
Iain Brown
Source :
Bueff, A C, Cytrynski, M, Calabrese, R, Jones, M, Roberts, J, Moore, J & Brown, I 2022, ' Machine learning interpretability for a stress scenario generation in credit scoring based on counterfactuals ', Expert Systems with Applications, vol. 202, 117271 . https://doi.org/10.1016/j.eswa.2022.117271
Publication Year :
2022
Publisher :
Elsevier BV, 2022.

Abstract

To boost the application of machine learning (ML) techniques for credit scoring models, the blackbox problem should be addressed. The primary aim of this paper is to propose a measure based on counterfactuals to evaluate the interpretability of a ML credit scoring technique. Counterfactuals assist with understanding the model with regard to the classification decision boundaries and evaluate model robustness. The second contribution is the development of a data perturbation technique to generate a stress scenario.We apply these two proposals to a dataset on UK unsecured personal loans to compare logistic regression and stochastic gradient boosting (SBG). We show that training a blackbox model (SGB) as conditioned on our data perturbation technique can provide insight into model performance under stressed scenarios. The empirical results show that our interpretability measure is able to capture the classification decision boundary, unlike AUC and the classification accuracy widely used in the banking sector.

Details

ISSN :
09574174
Volume :
202
Database :
OpenAIRE
Journal :
Expert Systems with Applications
Accession number :
edsair.doi.dedup.....98ff07cdef992331840e1418eec5a254