Back to Search Start Over

Correct Pronunciation Detection of the Arabic Alphabet Using Deep Learning

Authors :
Nishmia Ziafat
Hafiz Farooq Ahmad
Iram Fatima
Muhammad Zia
Abdulaziz Alhumam
Kashif Rajpoot
Source :
Applied Sciences, Vol 11, Iss 6, p 2508 (2021)
Publication Year :
2021
Publisher :
MDPI AG, 2021.

Abstract

Automatic speech recognition for Arabic has its unique challenges and there has been relatively slow progress in this domain. Specifically, Classic Arabic has received even less research attention. The correct pronunciation of the Arabic alphabet has significant implications on the meaning of words. In this work, we have designed learning models for the Arabic alphabet classification based on the correct pronunciation of an alphabet. The correct pronunciation classification of the Arabic alphabet is a challenging task for the research community. We divide the problem into two steps, firstly we train the model to recognize an alphabet, namely Arabic alphabet classification. Secondly, we train the model to determine its quality of pronunciation, namely Arabic alphabet pronunciation classification. Due to the less availability of audio data of this kind, we had to collect audio data from the experts, and novices for our model’s training. To train these models, we extract pronunciation features from audio data of the Arabic alphabet using mel-spectrogram. We have employed a deep convolution neural network (DCNN), AlexNet with transfer learning, and bidirectional long short-term memory (BLSTM), a type of recurrent neural network (RNN), for the classification of the audio data. For alphabet classification, DCNN, AlexNet, and BLSTM achieve an accuracy of 95.95%, 98.41%, and 88.32%, respectively. For Arabic alphabet pronunciation classification, DCNN, AlexNet, and BLSTM achieve an accuracy of 97.88%, 99.14%, and 77.71%, respectively.

Details

Language :
English
ISSN :
20763417
Volume :
11
Issue :
6
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.332b245e11b14ab6befecbd4610f3244
Document Type :
article
Full Text :
https://doi.org/10.3390/app11062508