Back to Search Start Over

Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection

Authors :
Fei Wu
Yunlu Xu
Shiliang Pu
Yi Niu
Jianwen Xie
Chengwei Zhang
Zhanzhan Cheng
Source :
AAAI
Publication Year :
2019
Publisher :
Association for the Advancement of Artificial Intelligence (AAAI), 2019.

Abstract

This paper proposes a segregated temporal assembly recurrent (STAR) network for weakly-supervised multiple action detection. The model learns from untrimmed videos with only supervision of video-level labels and makes prediction of intervals of multiple actions. Specifically, we first assemble video clips according to class labels by an attention mechanism that learns class-variable attention weights and thus helps the noise relieving from background or other actions. Secondly, we build temporal relationship between actions by feeding the assembled features into an enhanced recurrent neural network. Finally, we transform the output of recurrent neural network into the corresponding action distribution. In order to generate more precise temporal proposals, we design a score term called segregated temporal gradient-weighted class activation mapping (ST-GradCAM) fused with attention weights. Experiments on THUMOS'14 and ActivityNet1.3 datasets show that our approach outperforms the state-of-the-art weakly-supervised method, and performs at par with the fully-supervised counterparts.<br />Comment: Accepted to Proc. AAAI Conference on Artificial Intelligence 2019

Details

ISSN :
23743468 and 21595399
Volume :
33
Database :
OpenAIRE
Journal :
Proceedings of the AAAI Conference on Artificial Intelligence
Accession number :
edsair.doi.dedup.....96d31175de9a9dbc7d4f6b0a8c4fe07a
Full Text :
https://doi.org/10.1609/aaai.v33i01.33019070