Back to Search Start Over

Mining Interpretable AOG Representations From Convolutional Networks via Active Question Answering.

Authors :
Zhang, Quanshi
Ren, Jie
Huang, Ge
Cao, Ruiming
Wu, Ying Nian
Zhu, Song-Chun
Source :
IEEE Transactions on Pattern Analysis & Machine Intelligence. Nov2021, Vol. 43 Issue 11, p3949-3963. 15p.
Publication Year :
2021

Abstract

In this paper, we present a method to mine object-part patterns from conv-layers of a pre-trained convolutional neural network (CNN). The mined object-part patterns are organized by an And-Or graph (AOG). This interpretable AOG representation consists of a four-layer semantic hierarchy, i.e., semantic parts, part templates, latent patterns, and neural units. The AOG associates each object part with certain neural units in feature maps of conv-layers. The AOG is constructed with very few annotations (e.g., 3–20) of object parts. We develop a question-answering (QA) method that uses active human-computer communications to mine patterns from a pre-trained CNN, in order to explain features in conv-layers incrementally. During the learning process, our QA method uses the current AOG for part localization. The QA method actively identifies objects, whose feature maps cannot be explained by the AOG. Then, our method asks people to annotate parts on the unexplained objects, and uses answers to discover CNN patterns corresponding to newly labeled parts. In this way, our method gradually grows new branches and refines existing branches on the AOG to semanticize CNN representations. In experiments, our method exhibited a high learning efficiency. Our method used about $1/6$ 1 / 6 – $1/3$ 1 / 3 of the part annotations for training, but achieved similar or better part-localization performance than fast-RCNN methods. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01628828
Volume :
43
Issue :
11
Database :
Academic Search Index
Journal :
IEEE Transactions on Pattern Analysis & Machine Intelligence
Publication Type :
Academic Journal
Accession number :
153710041
Full Text :
https://doi.org/10.1109/TPAMI.2020.2993147