Back to Search
Start Over
Optimal trained artificial neural network for Telugu speaker diarization
- Source :
- Evolutionary Intelligence. 13:631-648
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- Speaker indexing or diarization is the process of automatically partitioning the conversation involving multiple speakers into homogeneous segments and grouping together all the segments that correspond to the same speaker. So far, certain works have been done under this aspect; still, the need of accurate partitioning process gets lagged under certain criteria. With this in mind, this paper aims to introduce a new speaker indexing or diarization model (Telugu language) that initially involves Mel Frequency Cepstral coefficient based feature extraction. Subsequently, a new Optimized Artificial Neural Network (ANN) is introduced for clustering process. The novelty behind the clustering process is: the training of ANN takes place through optimization logic that updates the weight of ANN by a hybrid concept of Artificial Bee Colony (ABC) and Lion Algorithm (LA). Thereby, the proposed model is named as ANN-ABC-LA model. Finally, the performance of the proposed ANN-ABC-LA model is compared over the state-of-the-art models with respect to different performance measures.
- Subjects :
- Artificial neural network
Computer science
Cognitive Neuroscience
Speech recognition
Search engine indexing
Feature extraction
Process (computing)
020206 networking & telecommunications
02 engineering and technology
Telugu
language.human_language
Speaker diarisation
Mathematics (miscellaneous)
Artificial Intelligence
0202 electrical engineering, electronic engineering, information engineering
language
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
Mel-frequency cepstrum
Cluster analysis
Subjects
Details
- ISSN :
- 18645917 and 18645909
- Volume :
- 13
- Database :
- OpenAIRE
- Journal :
- Evolutionary Intelligence
- Accession number :
- edsair.doi...........694c2a7e73b74f3347257dedfd146ac9
- Full Text :
- https://doi.org/10.1007/s12065-020-00378-9