Back to Search Start Over

MADPL-net: Multi-layer attention dictionary pair learning network for image classification.

Authors :
Sun, Yulin
Shi, Guangming
Dong, Weisheng
Xie, Xuemei
Source :
Journal of Visual Communication & Image Representation. Feb2023, Vol. 90, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

With the great success of deep neural networks, combining deep learning with traditional dictionary learning has become a hot issue. However, the performance of these methods is still limited for several reasons. First, some existing methods update dictionary learning and classifier as two independent modules, which limits the classification performance. Second, the non-attention dictionary is learned to represent all images, reducing the model representation flexibility. In this paper, we design a novel end-to-end model named Multi-layer Attention Dictionary Pair Learning Network (MADPL-net), which integrates the learning schemes of the convolutional neural network, deep encoder learning, and attention dictionary pair learning (ADicL) into a unified framework. The encoder layer contains the ADicL block, which selects more image-attentive atoms in the dictionary pair block via the softmax function to ensure MADPL-net classification capability. In addition, ADicL schema can yield discriminative dictionary atoms and feature maps with high inter-class separation and high intra-class compactness. To improve the sparse representation learning performance, MADPL-net adds l 1 − norm constraint of the analysis dictionary to the cross-entropy loss function. Extensive experiments show that MADPL-net can achieve excellent performance over other state-of-the-arts. • A powerful architecture, called the multi-layer attention dictionary pair learning network (MADPL-net), is proposed for image classification. • The MADPL-net integrates convolutional neural network, deep encoder learning, and attention dictionary pair learning into a unified framework. • The MADPL-net applies the attentional dictionary learning block to select more image-attentive atoms via the soft-max function. • The MADPL-net adds the l 1 − norm constraint of analysis dictionary to the crossentropy loss function. • The MADPL-net can ensure discriminative dictionary atoms and feature maps with high inter-class separation and high intra-class compactness. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10473203
Volume :
90
Database :
Academic Search Index
Journal :
Journal of Visual Communication & Image Representation
Publication Type :
Academic Journal
Accession number :
161362816
Full Text :
https://doi.org/10.1016/j.jvcir.2022.103728