Understanding and explaining convolutional neural networks based on inverse approach.

Authors :: Kwon, Hyuk Jin
Koo, Hyung Il
Cho, Nam Ik
Source :: Cognitive Systems Research. Jan2023, Vol. 77, p142-152. 11p.
Publication Year :: 2023
Abstract: Interpretability and explainability of machine learning systems have received ever-increasing attention, especially for deep neural networks. In the case of convolutional neural networks (CNNs), their properties are usually explained by generating local explanation maps (e.g., visualizing the contribution of individual pixels to a given prediction). In this paper, we propose a new framework that analyzes the inner workings of CNNs in terms of neural activations. To be precise, we consider a forward-pass as sequential activations of neurons and develop its inverse process, so that the inverse preserves the physical meaning of neuron activations. Our inverse process is formulated as a constrained optimization problem, and we solve the problem with the gradient projection algorithm. The proposed approach can provide equivalent visualization results to several conventional methods, and thus can be a reference tool for CNN visualization. Also, the attributions generated by our inverse method yield the state-of-the-art deletion scores and visualize the contribution of colors as well as shape features. [ABSTRACT FROM AUTHOR]

Subjects :: *CONVOLUTIONAL neural networks
*ARTIFICIAL neural networks
*MACHINE learning
*CONSTRAINED optimization

Full Text Access

Tools