Back to Search Start Over

End-to-end listening agent for audiovisual emotional and naturalistic interactions

Authors :
Kevin El Haddad
Yara Rizk
Louise Heron
Nadine Hajj
Yong Zhao
Jaebok Kim
Trung Ngô Trọng
Minha Lee
Marwan Doumit
Payton Lin
Yelin Kim
Hüseyin Çakmak
Source :
Journal of Science and Technology of the Arts, Vol 10, Iss 2 (2018)
Publication Year :
2018
Publisher :
Universidade Católica Portuguesa, 2018.

Abstract

In this work, we established the foundations of a framework with the goal to build an end-to-end naturalistic expressive listening agent. The project was split into modules for recognition of the user’s paralinguistic and nonverbal expressions, prediction of the agent’s reactions, synthesis of the agent’s expressions and data recordings of nonverbal conversation expressions. First, a multimodal multitask deep learning-based emotion classification system was built along with a rule-based visual expression detection system. Then several sequence prediction systems for nonverbal expressions were implemented and compared. Also, an audiovisual concatenation-based synthesis system was implemented. Finally, a naturalistic, dyadic emotional conversation database was collected. We report here the work made for each of these modules and our planned future improvements.

Details

Language :
English
ISSN :
16469798 and 21830088
Volume :
10
Issue :
2
Database :
Directory of Open Access Journals
Journal :
Journal of Science and Technology of the Arts
Publication Type :
Academic Journal
Accession number :
edsdoj.fa1ac0b904974532b6abf0ac58fa1261
Document Type :
article
Full Text :
https://doi.org/10.7559/citarj.v10i2.424