1. Deep-LIFT: Deep Label-Specific Feature Learning for Image Annotation
- Author
-
Changqing Zhang, Huazhu Fu, Qinghua Hu, Joey Tianyi Zhou, Junbing Li, and Shuyin Xia
- Subjects
Lift (data mining) ,business.industry ,Computer science ,Pattern recognition ,Computer Science Applications ,Image (mathematics) ,Human-Computer Interaction ,ComputingMethodologies_PATTERNRECOGNITION ,Automatic image annotation ,Control and Systems Engineering ,Feature (computer vision) ,Benchmark (computing) ,Graph (abstract data type) ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Feature learning ,Algorithms ,Data Curation ,Software ,Information Systems ,Interpretability - Abstract
Image annotation aims to jointly predict multiple tags for an image. Although significant progress has been achieved, existing approaches usually overlook aligning specific labels and their corresponding regions due to the weak supervised information (i.e., ``bag of labels'' for regions), thus failing to explicitly exploit the discrimination from different classes. In this article, we propose the deep label-specific feature (Deep-LIFT) learning model to build the explicit and exact correspondence between the label and the local visual region, which improves the effectiveness of feature learning and enhances the interpretability of the model itself. Deep-LIFT extracts features for each label by aligning each label and its region. Specifically, Deep-LIFTs are achieved through learning multiple correlation maps between image convolutional features and label embeddings. Moreover, we construct two variant graph convolutional networks (GCNs) to further capture the interdependency among labels. Empirical studies on benchmark datasets validate that the proposed model achieves superior performance on multilabel classification over other existing state-of-the-art methods.
- Published
- 2022