Back to Search
Start Over
Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors
- Source :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28:355-368
- Publication Year :
- 2020
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2020.
-
Abstract
- In our previous work, we introduced our Bayesian Hidden Markov Model with eigenvoice priors, which has been recently recognized as the state-of-the-art model for Speaker Diarization. In this article we present a more complete analysis of the Diarization system. The inference of the model is fully described and derivations of all update formulas are provided for a complete understanding of the algorithm. An extensive analysis on the effect, sensitivity and interactions of all model parameters is provided, which might be used as a guide for their optimal setting. The newly introduced speaker regularization coefficient allows us to control the number of speakers inferred in an utterance. A naive speaker model merging strategy is also presented, which allows to drive the variational inference out of local optima. Experiments for the different diarization scenarios are presented on CALLHOME and DIHARD datasets.
- Subjects :
- Acoustics and Ultrasonics
Computer science
Speech recognition
Bayesian probability
Probabilistic logic
Inference
Speech processing
Speaker diarisation
030507 speech-language pathology & audiology
03 medical and health sciences
Computational Mathematics
Local optimum
Computer Science::Sound
Prior probability
Computer Science (miscellaneous)
Electrical and Electronic Engineering
0305 other medical science
Hidden Markov model
Subjects
Details
- ISSN :
- 23299304 and 23299290
- Volume :
- 28
- Database :
- OpenAIRE
- Journal :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- Accession number :
- edsair.doi...........a9d4b962a347760a3cfe2fbe701b2c82
- Full Text :
- https://doi.org/10.1109/taslp.2019.2955293