Author: "Songlong Xing" / Journal: ieee transactions on affective computing - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Songlong Xing"' showing total 2 results

Start Over Author "Songlong Xing" Journal ieee transactions on affective computing

2 results on '"Songlong Xing"'

1. Adapted Dynamic Memory Network for Emotion Recognition in Conversation

Author: Sijie Mai, Haifeng Hu, and Songlong Xing
Subjects: Dependency (UML), Computer science, media_common.quotation_subject, Speech recognition, Process (computing), 020206 networking & telecommunications, Context (language use), 02 engineering and technology, Human-Computer Interaction, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), Task analysis, 020201 artificial intelligence & image processing, Conversation, Representation (mathematics), Episodic memory, Software, media_common
Abstract: In this paper, we address Emotion Recognition in Conversation (ERC) where conversational data are presented in a multimodal setting. Psychological evidence shows that self and inter-speaker influence are two central factors to emotion dynamics in conversation. State-of-the-art models do not effectively synthesise these two factors. Therefore, we propose an Adapted Dynamic Memory Network (A-DMN) where self and inter-speaker influences are modelled individually and further synthesised oriented towards the current utterance. Specifically, we model the dependency of the constituent utterances in a dialogue video using a global RNN to capture inter-speaker influence. Likewise, each speaker is assigned an RNN to capture their self influence. Afterwards, an Episodic Memory Module is devised to extract contexts for self and inter-speaker influence and synthesise them to update the memory. This process repeats itself for multiple passes until a refined representation is obtained and used for final prediction. Additionally, we explore cross-modal fusion in the context of multimodal ERC, and propose a convolution-based method which proves effective in extracting local interactions and computationally efficient. Extensive experiments demonstrate that A-DMN outperforms the state-of-the-art models on benchmark datasets.
Published: 2022

2. Multi-Fusion Residual Memory Network for Multimodal Human Sentiment Comprehension

Author: Sijie Mai, Haifeng Hu, Songlong Xing, and Jia Xu
Subjects: Focus (computing), Sequence, Modalities, Forgetting, Dependency (UML), Computer science, business.industry, Process (engineering), 020206 networking & telecommunications, 02 engineering and technology, Machine learning, computer.software_genre, Human-Computer Interaction, Comprehension, 030507 speech-language pathology & audiology, 03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, State (computer science), Artificial intelligence, 0305 other medical science, business, computer, Software
Abstract: Multimodal human sentiment comprehension refers to recognizing human affection from multiple modalities. There exist two key issues for this problem. Firstly, it is difficult to explore time-dependent interactions between modalities and focus on the important time steps. Secondly, processing the long fused sequence of utterances is susceptible to the forgetting problem due to the long-term temporal dependency. In this paper, we introduce a hierarchical learning architecture to classify utterance-level sentiment. To address the first issue, we perform time-step level fusion to generate fused features for each time step, which explicitly models time-restricted interactions by incorporating information across modalities at the same time step. Furthermore, based on the assumption that acoustic features directly reflect emotional intensity, we pioneer emotion intensity attention to focus on the time steps where emotion changes or intense affections take place. To handle the second issue, we propose Residual Memory Network (RMN) to process the fused sequence. RMN utilizes some techniques such as directly passing the previous state into the next time step, which helps to retain the information from many time steps ago. We show that our method achieves state-of-the-art performance on multiple datasets. Results also suggest that RMN yields competitive performance on sequence modeling tasks.
Published: 2022

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Songlong Xing"'

1. Adapted Dynamic Memory Network for Emotion Recognition in Conversation

2. Multi-Fusion Residual Memory Network for Multimodal Human Sentiment Comprehension

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

2 results on '"Songlong Xing"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources