Back to Search Start Over

Weakly supervised temporal action localization: a survey.

Authors :
Li, Ronglu
Zhang, Tianyi
Zhang, Rubo
Source :
Multimedia Tools & Applications; Sep2024, Vol. 83 Issue 32, p78361-78386, 26p
Publication Year :
2024

Abstract

Temporal action localization (TAL) is one of the most important tasks in video understanding. Weakly supervised temporal action localization (WTAL) involves classifying and localizing all the action instances in untrimmed videos under the supervision of only video-level category labels, which is a challenging task because of the absence of frame-level annotations. In this study, first, we review the development process of the WTAL task in recent years, summarize and analyze the main problems of WTAL. Second, we classify and compare the research approaches of existing models and thoroughly discuss methods based on multiple instance learning (MIL), feature erasing, the attention mechanism, similarity propagation, pseudo-ground truth generation, contrastive learning, and adversarial learning. Then, we present the datasets and evaluation criteria for the WTAL task. Finally, we discuss the main application areas and further developments in WTAL. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
DEEP learning
VIDEOS
ANNOTATIONS

Details

Language :
English
ISSN :
13807501
Volume :
83
Issue :
32
Database :
Complementary Index
Journal :
Multimedia Tools & Applications
Publication Type :
Academic Journal
Accession number :
179439275
Full Text :
https://doi.org/10.1007/s11042-024-18554-9