Back to Search
Start Over
Weakly supervised temporal action localization: a survey.
- Source :
- Multimedia Tools & Applications; Sep2024, Vol. 83 Issue 32, p78361-78386, 26p
- Publication Year :
- 2024
-
Abstract
- Temporal action localization (TAL) is one of the most important tasks in video understanding. Weakly supervised temporal action localization (WTAL) involves classifying and localizing all the action instances in untrimmed videos under the supervision of only video-level category labels, which is a challenging task because of the absence of frame-level annotations. In this study, first, we review the development process of the WTAL task in recent years, summarize and analyze the main problems of WTAL. Second, we classify and compare the research approaches of existing models and thoroughly discuss methods based on multiple instance learning (MIL), feature erasing, the attention mechanism, similarity propagation, pseudo-ground truth generation, contrastive learning, and adversarial learning. Then, we present the datasets and evaluation criteria for the WTAL task. Finally, we discuss the main application areas and further developments in WTAL. [ABSTRACT FROM AUTHOR]
- Subjects :
- DEEP learning
VIDEOS
ANNOTATIONS
Subjects
Details
- Language :
- English
- ISSN :
- 13807501
- Volume :
- 83
- Issue :
- 32
- Database :
- Complementary Index
- Journal :
- Multimedia Tools & Applications
- Publication Type :
- Academic Journal
- Accession number :
- 179439275
- Full Text :
- https://doi.org/10.1007/s11042-024-18554-9