1. AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children’s Speech
- Author
-
Felicity Cox, Tharmakulasingam Sirojan, Elise Baker, Kirrie J. Ballard, Beena Ahmed, Vidhyasaharan Sethu, Barbara Kelly, Katherine Demuth, Joanne Arciuli, Chloé Diskin-Holdaway, Hadi Mehmood, Titia Benders, Dominique Estival, Mostafa Shahin, Denis K Burnham, Chwee Beng Lee, Julien Epps, and Eliathamby Ambikairajah
- Subjects
business.industry ,Computer science ,Auditory visual ,Speech corpus ,computer.software_genre ,Speech processing ,language.human_language ,Australian English ,language ,Narrative ,Artificial intelligence ,User interface ,business ,computer ,Protocol (object-oriented programming) ,Natural language processing ,Sentence - Abstract
Here we present AusKidTalk [1], an audio-visual (AV) corpus of Australian children’s speech collected to facilitate the development of speech based technological solutions for children. It builds upon the technology and expertise developed through the collection of an earlier corpus of Australian adult speech, AusTalk [2,3]. This multi-site initiative was established to remedy the dire shortage of children’s speech corpora in Australia and around the world that are sufficiently sized to train accurate automated speech processing tools for children. We are collecting ~600 hours of speech from children aged 3–12 years that includes single word and sentence productions as well as narrative and emotional speech. In this paper, we discuss the key requirements for AusKidTalk and how we designed the recording setup and protocol to meet them. We also discuss key findings from our feasibility study of the recording protocol, recording tools, and user interface.
- Published
- 2021
- Full Text
- View/download PDF