Back to Search
Start Over
How does that name sound? Name representation learning using accent-specific speech generation.
- Source :
-
Knowledge-Based Systems . Sep2021, Vol. 227, pN.PAG-N.PAG. 1p. - Publication Year :
- 2021
-
Abstract
- Searching for information about a specific person is a frequent online activity. In most cases, users are aided in the search process by queries containing a name in Web search engines. Typically, Web search engines provide just a few accurate results associated with a name-containing query. Most existing solutions for suggesting synonyms in online search are based on pattern matching and phonetic encoding, however very often, the performance of such solutions is less than optimal. In this paper, we propose SpokenName2Vec , a novel and generic algorithm which addresses the synonym suggestion problem by utilizing automated speech generation, and deep learning to produce novel spoken name embeddings. These embeddings capture the way people pronounce names in a particular language and accent. Utilizing a name's pronunciation can help detect names that sound alike, but are written differently. We demonstrated the proposed approach on a large-scale dataset with more than 250,000 forenames and surnames and evaluated it on two ground truth datasets containing 7400 forenames and 25,000 surnames (including their verified synonyms). The performance of SpokenName2Vec was found superior to the 10 other algorithms evaluated, including phonetic encoding, string similarity, and machine learning algorithms. The results obtained emphasize the potential of spoken name embeddings for improved synonym suggestion. • Proposing SpokenName2Vec, a novel and generic algorithm which addresses the synonym suggestion problem by utilizing automated speech generation and deep learning to produce spoken name embeddings. • Behind the Name dataset: In total, 37,916 synonyms were retrieved for the 7,399 distinct names. • Spoken Name dataset: 250K WAV files associated with the names in the dataset for 11 languages. • A demonstration of the quality of SpokenName2Vec on forenames and surnames, including a comparison to other 10 algorithms. [ABSTRACT FROM AUTHOR]
- Subjects :
- *WEB search engines
*DEEP learning
*MACHINE learning
*ALGORITHMS
*PERSONAL names
Subjects
Details
- Language :
- English
- ISSN :
- 09507051
- Volume :
- 227
- Database :
- Academic Search Index
- Journal :
- Knowledge-Based Systems
- Publication Type :
- Academic Journal
- Accession number :
- 151556966
- Full Text :
- https://doi.org/10.1016/j.knosys.2021.107229