Back to Search
Start Over
Efficient corpus design for wake-word detection
- Source :
- SLT
- Publication Year :
- 2021
- Publisher :
- IEEE, 2021.
-
Abstract
- Wake-word detection is an indispensable technology for preventing virtual voice agents from being unintentionally triggered. Although various neural networks were proposed for wake-word detection, less attention has been paid to efficient corpus design, which we address in this study. For this purpose, we collected speech data via a crowdsourcing platform and evaluated the performance of several neural networks when different subsets of the corpus were used for training. The results reveal the following requirements for efficient corpus design to produce a lower misdetection rate: (1) short segments of continuous speech can be used as negative samples, but they are not as effective as random words; (2) utterances of "adversarial" words, i.e., phonetically similar words to a wake-word, contribute to improving performance significantly when they are used as negative samples; (3) it is preferable for individual speakers to provide both positive and negative samples; (4) increasing the number of speakers is better than increasing the number of repetitions of a wake-word by each speaker.
- Subjects :
- Training set
Artificial neural network
business.industry
Computer science
Speech recognition
020206 networking & telecommunications
Speech synthesis
02 engineering and technology
Crowdsourcing
computer.software_genre
Adversarial system
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
business
computer
Word (computer architecture)
Natural language
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2021 IEEE Spoken Language Technology Workshop (SLT)
- Accession number :
- edsair.doi...........5bb357fbb13e1dc71567a5f62badf540
- Full Text :
- https://doi.org/10.1109/slt48900.2021.9383569