Back to Search
Start Over
Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition
- Source :
- IEEE Transactions on Multimedia. 25:624-634
- Publication Year :
- 2023
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2023.
-
Abstract
- We focus on unsupervised representation learning for skeleton based action recognition. Existing unsupervised approaches usually learn action representations by motion prediction but they lack the ability to fully learn inherent semantic similarity. In this paper, we propose a novel framework named Prototypical Contrast and Reverse Prediction (PCRP) to address this challenge. Different from plain motion prediction, PCRP performs reverse motion prediction based on encoder-decoder structure to extract more discriminative temporal pattern, and derives action prototypes by clustering to explore the inherent action similarity within the action encoding. Specifically, we regard action prototypes as latent variables and formulate PCRP as an expectation-maximization (EM) task. PCRP iteratively runs (1) E-step as to determine the distribution of action prototypes by clustering action encoding from the encoder while estimating concentration around prototypes, and (2) M-step as optimizing the model by minimizing the proposed ProtoMAE loss, which helps simultaneously pull the action encoding closer to its assigned prototype by contrastive learning and perform reverse motion prediction task. Besides, the sorting can also serve as a temporal task similar as reverse prediction in the proposed framework. Extensive experiments on N-UCLA, NTU 60, and NTU 120 dataset present that PCRP outperforms main stream unsupervised methods and even achieves superior performance over many supervised methods.
- Subjects :
- Computer science
business.industry
Sorting
Pattern recognition
Computer Science Applications
Semantic similarity
Discriminative model
Action (philosophy)
Encoding (memory)
Signal Processing
Media Technology
Artificial intelligence
Electrical and Electronic Engineering
business
Cluster analysis
Encoder
Feature learning
Subjects
Details
- ISSN :
- 19410077 and 15209210
- Volume :
- 25
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Multimedia
- Accession number :
- edsair.doi...........537911953d33a2021f7f9394d7f0fea0
- Full Text :
- https://doi.org/10.1109/tmm.2021.3129616