1. Scalable Identity-Oriented Speech Retrieval
- Author
-
Rongzhong Lian, Lixin Fan, Chen Zhang, Jinhua Peng, Chaotao Chen, Yawen Li, Lei Chen, and Di Jiang
- Subjects
Computer science ,business.industry ,Search engine indexing ,Speech retrieval ,Financial risk management ,Snippet ,computer.software_genre ,Computer Science Applications ,Task (project management) ,Computational Theory and Mathematics ,Scalable system ,Scalability ,Identity (object-oriented programming) ,Artificial intelligence ,business ,computer ,Natural language processing ,Information Systems - Abstract
With the prevalence of voice devices in our daily life, speech data is accumulated at an unprecedented speed. The vast amount of speech data form an invaluable database for security surveillance and financial risk management. However, the speeches collected from different sources are not necessarily annotated with regard to the speaker identity, making the task of retrieving all the speech records for a given identity extremely challenging. In this paper, we propose a scalable system for Identity-Oriented Speech Retrieval (IO-SR), which seamlessly integrates speaker modeling and deep indexing techniques. Given a speech snippet and a speech database, IO-SR efficiently retrieves all speech snippets that are uttered by the same speaker as the given one. Evaluations on an industrial dataset containing millions of speech snippets show that our system achieves superior performance compared with the state-of-the-arts.
- Published
- 2023
- Full Text
- View/download PDF