Back to Search Start Over

Fusing Affective Dimensions and Audio-Visual Features from Segmented Video for Depression Recognition

Authors :
David Pinto-Avedaño
Luis Villaseñor-Pineda
Manuel Montes-y-Gómez
Veronica Reyez-Meza
Hugo Jair Escalante
Humberto Pérez Espinosa
Source :
AVEC@MM
Publication Year :
2014
Publisher :
ACM, 2014.

Abstract

Depression is a disease that affects a considerable portion of the world population. Severe cases of depression interfere with the common live of patients, for those patients a strict monitoring is necessary in order to control the progress of the disease and to prevent undesired side effects. A way to keep track of patients with depression is by means of online monitoring via human-computer-interaction. The AVEC'14 challenge aims at developing technology towards the online monitoring of depression patients. This paper describes an approach to depression recognition from audiovisual information in the context of the AVEC'14 challenge. The proposed method relies on an effective voice segmentation procedure, followed by segment-level feature extraction and aggregation. Finally, a meta-model is trained to fuse mono-modal information. The main novel features of our proposal are that (1) we use affective dimensions for building depression recognition models; (2) we extract visual information from voice and silence segments separately; (3) we consolidate features and use a meta-model for fusion. The proposed methodology is evaluated, experimental results reveal the method is competitive.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge
Accession number :
edsair.doi...........1fb28871603166fdb8f46a26729213fb
Full Text :
https://doi.org/10.1145/2661806.2661815