1. Generalized zero-shot emotion recognition from body gestures
- Author
-
Jinting Wu, Yujia Zhang, Qianzhong Li, Shiying Sun, and Xiaoguang Zhao
- Subjects
Computer science ,business.industry ,Emotion classification ,Shot (filmmaking) ,Skeleton (category theory) ,computer.software_genre ,Zero (linguistics) ,Body language ,Artificial Intelligence ,Emotional expression ,Artificial intelligence ,business ,computer ,Classifier (UML) ,Natural language processing ,Gesture - Abstract
In human-human interaction, body language is one of the most important emotional expressions. However, each emotion category contains abundant emotional body gestures, and basic emotions used in most researches are difficult to describe complex and diverse emotional states. It is costly to collect sufficient samples of all emotional expressions, and new emotions or new body gestures that are not included in the training set may appear during testing. To address the above problems, we design a novel mechanism that treats each emotion category as a collection of multiple body gesture categories to make better use of gesture information for emotion recognition. A Generalized Zero-Shot Learning (GZSL) framework is introduced to recognize both seen and unseen body gesture categories with the help of semantic information, and emotion predictions are further provided based on the relationship between gestures and emotions. This framework consists of two branches. The first branch is a Hierarchical Prototype Network (HPN) which learns the prototypes of body gestures and uses them to calculate the emotion attentive prototypes. This branch aims to obtain predictions on samples of the seen gesture categories. The second branch is a Semantic Auto-Encoder (SAE) which utilizes semantic representations to predict samples of unseen gesture categories. Thresholds are further trained to determine which branch result will be used during testing, and the emotion labels are finally obtained from these results. Comprehensive experiments are conducted on an emotion recognition dataset which contains skeleton data of multiple body gestures, and the performance of our framework is superior to both the traditional emotion classifier and state-of-the-art zero-shot learning methods.
- Published
- 2021
- Full Text
- View/download PDF