1. How to Evaluate Humorous Response Generation, Seriously?
- Author
-
Vladislav Blinov, Pavel Braslavski, Katya Pertsova, and Valeria Bolotova
- Subjects
genetic structures ,Natural language user interface ,Joke ,business.industry ,media_common.quotation_subject ,Computational humor ,02 engineering and technology ,Crowdsourcing ,eye diseases ,fluids and secretions ,Variation (linguistics) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Quality (business) ,Language proficiency ,sense organs ,Dialog box ,Psychology ,business ,Cognitive psychology ,media_common - Abstract
Nowadays natural language user interfaces, such as chatbots and conversational agents, are very common. A desirable trait of such applications is a sense of humor. It is, therefore, important to be able to measure quality of humorous responses. However, humor evaluation is hard since humor is highly subjective. To address this problem, we conducted an online evaluation of 30 dialog jokes from different sources by almost 300 participants -- volunteers and Mechanical Turk workers. We collected joke ratings along with participants» age, gender, and language proficiency. Results show that demographics and joke topics can partly explain variation in humor judgments. We expect that these insights will aid humor evaluation and interpretation. The findings can also be of interest for humor generation methods in conversational systems.
- Published
- 2018