Back to Search Start Over

Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization.

Authors :
Wang C
Xu R
Xu S
Meng W
Wang R
Zhang X
Source :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society [IEEE Trans Image Process] 2024; Vol. 33, pp. 1045-1058. Date of Electronic Publication: 2024 Jan 31.
Publication Year :
2024

Abstract

Weakly supervised object localization (WSOL) is a challenging and promising task that aims to localize objects solely based on the supervision of image category labels. In the absence of annotated bounding boxes, WSOL methods must employ the intrinsic properties of the image classification task pipeline to generate object localizations. In this work, we propose a WSOL method for exploring the Intrinsic Discrimination and Consistency in the image classification task pipeline, and call it as IDC. First, we develop a Triplet Metrics Based Foreground Modeling (TMFM) framework to directly predict object foreground regions using intrinsic discrimination. Unlike Class Activation Map (CAM) based methods that also rely on intrinsic discrimination, our TMFM framework alleviates the problem of only focusing on the most discriminative parts by optimizing foreground and background regions synergistically. Second, we design a Dual Geometric Transformation Consistency Constraints (DGTC2) training strategy to introduce additional supervision and regularization constraints for WSOL by leveraging intrinsic geometric transformation consistency. The proposed pixel-wise and object-wise consistency constraint losses cost-effectively provide spontaneous supervision for WSOL. Extensive experiments show that our IDC method achieves significant and consistent performance gains compared to existing state-of-the-art WSOL approaches. Code is available at: https://github.com/vignywang/IDC.

Details

Language :
English
ISSN :
1941-0042
Volume :
33
Database :
MEDLINE
Journal :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Publication Type :
Academic Journal
Accession number :
38271174
Full Text :
https://doi.org/10.1109/TIP.2024.3356174