Back to Search Start Over

Multi-scale spatialtemporal information deep fusion network with temporal pyramid mechanism for video action recognition.

Authors :
Ou, Hongshi
Sun, Jifeng
Source :
Journal of Intelligent & Fuzzy Systems. 2021, Vol. 41 Issue 4, p4533-4545. 13p.
Publication Year :
2021

Abstract

In the deep learning-based video action recognitio, the function of the neural network is to acquire spatial information, motion information, and the associated information of the above two kinds of information over an uneven time span. This paper puts forward a network extracting video sequence semantic information based on deep integration of local Spatial-Temporal information. The network uses 2D Convolutional Neural Network (2DCNN) and Multi Spatial-Temporal scale 3D Convolutional Neural Network (MST_3DCNN) respectively to extract spatial information and motion information. Spatial information and motion information of the same time quantum receive 3D convolutional integration to generate the temporary Spatial-Temporal information of a certain moment. Then, the Spatial-Temporal information of multiple single moments enters Temporal Pyramid Net (TPN) to generate the local Spatial-Temporal information of multiple time scales. Finally, bidirectional recurrent neutral network is used to act on the Spatial-Temporal information of all parts so as to acquire the context information spanning the length of the entire video, which endows the network with video context information extraction capability. Through the experiments on the three video action recognitio common experimental data sets UCF101, UCF11, UCFSports, the Spatial-Temporal information deep fusion network proposed in this paper has a high correct recognition rate in the task of video action recognitio. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10641246
Volume :
41
Issue :
4
Database :
Academic Search Index
Journal :
Journal of Intelligent & Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
153410615
Full Text :
https://doi.org/10.3233/JIFS-189714