Search

Your search keyword '"Guo, Yiwei"' showing total 20 results

Search Constraints

Start Over You searched for: Author "Guo, Yiwei" Remove constraint Author: "Guo, Yiwei" Topic computer science - sound Remove constraint Topic: computer science - sound
20 results on '"Guo, Yiwei"'

Search Results

1. Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding

2. LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec

3. vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders

4. DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation

5. On the Effectiveness of Acoustic BPE in Decoder-Only TTS

6. Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech

7. StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

8. VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

9. SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention

10. Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations

11. Acoustic BPE for Speech Generation with Discrete Tokens

12. Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition

13. VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching

14. DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

15. UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding

16. Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge

17. DiffVoice: Text-to-Speech with Latent Diffusion

18. EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance

19. VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature

20. Unsupervised word-level prosody tagging for controllable speech synthesis

Catalog

Books, media, physical & digital resources