1. Comparison-based Inverse Classification for Interpretability in Machine Learning
- Author
-
Christophe Marsala, Thibault Laugel, Marcin Detyniecki, Xavier Renard, Marie-Jeanne Lesot, Learning, Fuzzy and Intelligent systems (LFI), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Groupe AXA (AXA), Jesús Medina, Manuel Ojeda-Aciego, José Luis Verdegay, David A. Pelta, Inma P. Cabrera, Bernadette Bouchon-Meunier, and Ronald R. Yager
- Subjects
Computer science ,business.industry ,05 social sciences ,Closeness ,050301 education ,Inverse ,02 engineering and technology ,Machine learning ,computer.software_genre ,comparison-based ,[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] ,[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG] ,Close neighbor ,0202 electrical engineering, electronic engineering, information engineering ,local explanation ,020201 artificial intelligence & image processing ,Artificial intelligence ,post-hoc interpretability ,business ,inverse classification ,0503 education ,computer ,Classifier (UML) ,Interpretability ,Test data - Abstract
International audience; In the context of post-hoc interpretability, this paper addresses the task of explaining the prediction of a classifier, considering the case where no information is available, neither on the classifier itself , nor on the processed data (neither the training nor the test data). It proposes an inverse classification approach whose principle consists in determining the minimal changes needed to alter a prediction: in an instance-based framework, given a data point whose classification must be explained, the proposed method consists in identifying a close neighbor classified differently, where the closeness definition integrates a spar-sity constraint. This principle is implemented using observation generation in the Growing Spheres algorithm. Experimental results on two datasets illustrate the relevance of the proposed approach that can be used to gain knowledge about the classifier.
- Published
- 2018