Back to Search Start Over

Development of a regional voice dataset and speaker classification based on machine learning.

Authors :
Ismail, Muhammad
Memon, Shahzad
Dhomeja, Lachhman Das
Shah, Shahid Munir
Hussain, Dostdar
Rahim, Sabit
Ali, Imran
Source :
Journal of Big Data; 3/2/2021, Vol. 8 Issue 1, p1-18, 18p
Publication Year :
2021

Abstract

At present, voice biometrics are commonly used for identification and authentication of users through their voice. Voice based services such as mobile banking, access to personal devices, and logging into social networks are the common examples of authenticating users through voice biometrics. In Pakistan, voice-based services are very common in banking and mobile/cellular sector, however, these services do not use voice features to recognize customers. Therefore, the chance to use these services with false identity is always high. It is essential to design a voice-based recognition system to minimize the risk of false identity. In this paper, we developed regional voice datasets for voice biometrics, by collecting voice data in different local accents of Pakistan. Although, there is a global need for voice biometrics especially when voice-based services are common, however, this paper uses Pakistan as a use case to show how to build regional voice dataset for voice biometrics. To build voice dataset, voice samples were recorded from 180 male and female speakers with two languages English and Urdu in form of five regional accents. Mel Frequency Cepstral Coefficient (MFCC) features were extracted from the collected voice samples to train Support Vector Machine (SVM), Artificial Neural Network (ANN), Random Forest (RF) and K-nearest neighbor (KNN) classifiers. The results indicate that ANN outperformed SVM, RF and KNN by achieving 88.53% and 86.58% recognition accuracy on both datasets respectively. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
21961115
Volume :
8
Issue :
1
Database :
Complementary Index
Journal :
Journal of Big Data
Publication Type :
Academic Journal
Accession number :
149049357
Full Text :
https://doi.org/10.1186/s40537-021-00435-9