1. The Technique of English Word Syllable Division in Speech Synthesis Based on Neural Network.
- Author
-
Ma, Hongjuan
- Subjects
- *
SPEECH synthesis , *STRESS (Linguistics) , *SAMPLING errors , *PROBLEM solving - Abstract
With the increasing maturity of speech synthesis technology, on the one hand, it has been more and more widely used in people's lives; on the other hand, it also brings more and more convenience to people. The requirements for speech synthesis systems are getting higher and higher. Therefore, advanced technology is used to improve and update the accent recognition system. This paper mainly introduces the word stress annotation technology combined with neural network speech synthesis technology. In Chinese speech synthesis, prosodic structure prediction has a great influence on naturalness. The purpose of this paper is to accurately predict the prosodic structure, which has become an important problem to be solved in speech synthesis. Experimental data show that the average error of samples in the network training process is lel/85, and the minimum value of the training error after 500 steps is 0.00013127, so the final sample average error is lel = 85 ∗ 0.0013127 = 0.112 < 0.5, and use the deep neural network (DNN) to train different parameters to obtain the conversion model, and then synthesize these conversion models, and finally achieve the effect of improving the synthesized sound quality. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF