Back to Search Start Over

Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector

Authors :
Wang, Hongbo
Lu, Junyu
Han, Yan
Ma, Kai
Yang, Liang
Lin, Hongfei
Publication Year :
2024

Abstract

Patronizing and Condescending Language (PCL) is a form of discriminatory toxic speech targeting vulnerable groups, threatening both online and offline safety. While toxic speech research has mainly focused on overt toxicity, such as hate speech, microaggressions in the form of PCL remain underexplored. Additionally, dominant groups' discriminatory facial expressions and attitudes toward vulnerable communities can be more impactful than verbal cues, yet these frame features are often overlooked. In this paper, we introduce the PCLMM dataset, the first Chinese multimodal dataset for PCL, consisting of 715 annotated videos from Bilibili, with high-quality PCL facial frame spans. We also propose the MultiPCL detector, featuring a facial expression detection module for PCL recognition, demonstrating the effectiveness of modality complementarity in this challenging task. Our work makes an important contribution to advancing microaggression detection within the domain of toxic speech.<br />Comment: Under review in ICASSP 2025

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2409.05005
Document Type :
Working Paper