1. Overview of the sixth dialog system technology challenge: DSTC6
- Author
-
Y-Lan Boureau, Ryuichiro Higashinaka, Michimasa Inaba, Julien Perez, Chiori Hori, Yuiko Tsunomori, Takaaki Hori, Seokhwan Kim, Koichiro Yoshino, and Tetsuro Takahashi
- Subjects
Class (computer programming) ,Goal orientation ,Computer science ,media_common.quotation_subject ,Natural language generation ,020206 networking & telecommunications ,02 engineering and technology ,computer.software_genre ,01 natural sciences ,Theoretical Computer Science ,Domain (software engineering) ,Task (project management) ,Human-Computer Interaction ,Human–computer interaction ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,Conversation ,Dialog box ,Dialog system ,010301 acoustics ,computer ,Software ,media_common - Abstract
This paper describes the experimental setups and the evaluation results of the sixth Dialog System Technology Challenges (DSTC6) aiming to develop end-to-end dialogue systems. Neural network models have become a recent focus of investigation in dialogue technologies. Previous models required training data to be manually annotated with word meanings and dialogue states, but end-to-end neural network dialogue systems learn to directly output natural-language system responses without needing training data to be manually annotated. Thus, this approach allows us to scale up the size of training data and cover more dialog domains. In addition, dialogue systems require a meta-function to avoid deploying inappropriate responses generated by themselves. To challenge such issues, the DSTC6 consists of three tracks, (1). End-to-End Goal Oriented dialogue Learning to select system responses, (2). End-to-End Conversation Modeling to generate system responses using Natural Language Generation (NLG) and (3). Dialogue Breakdown Detection. Since each domain has different issues to be addressed to develop dialogue systems, we targeted restaurant retrieval dialogues to fill slot-value in Track 1, customer services on Twitter by combining goal-oriented dialogues and ChitChat in Track 2 and human-machine dialogue data for ChitChat in Track 3. DSTC6 had 141 people declaring their interests and 23 teams submitted their final results. 18 scientific papers were presented in the wrap-up workshop. We find the blending end-to-end trainable models associated to meaningful prior knowledge performs the best for the restaurant retrieval for Track 1. Indeed, Hybrid Code Network and Memory Network have been the best models for this task. In Track 2, 78.5% of the system responses automatically generated by the best system were rated better than acceptable by humans and this achieves 89% of the number of the human responses rated in the same class. In Track3, the dialogue breakdown detection technologies performed as well as human agreements, in both data-sets of English and Japanese.
- Published
- 2019
- Full Text
- View/download PDF