Search

Your search keyword '"Liu, Shujie"' showing total 589 results

Search Constraints

Start Over You searched for: Author "Liu, Shujie" Remove constraint Author: "Liu, Shujie" Search Limiters Available in Library Collection Remove constraint Search Limiters: Available in Library Collection
589 results on '"Liu, Shujie"'

Search Results

1. Autoregressive Speech Synthesis without Vector Quantization

2. VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment

3. VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers

4. TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation

5. CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations

6. RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

7. WavLLM: Towards Robust and Adaptive Speech Large Language Model

8. Advanced Long-Content Speech Recognition With Factorized Neural Transducer

9. Boosting Large Language Model for Speech Synthesis: An Empirical Study

10. COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning

12. Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction

13. WavMark: Watermarking for Audio Generation

14. SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

15. On decoder-only architecture for speech-to-text and large language model integration

16. OpenNDD: Open Set Recognition for Neurodevelopmental Disorders Detection

17. Accelerating Transducers through Adjacent Token Merging

18. Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition

19. VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

20. ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation

22. Code-Switching Text Generation and Injection in Mandarin-English ASR

23. Target Sound Extraction with Variable Cross-modality Clues

24. Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

25. Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

26. Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

27. BEATs: Audio Pre-Training with Acoustic Tokenizers

28. VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning

29. Exploring WavLM on Speech Enhancement

30. LongFNT: Long-form Speech Recognition with Factorized Neural Transducer

31. LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

32. Two-Stream Network for Sign Language Recognition and Translation

33. Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation

34. SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training

35. SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

36. Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training

37. The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

38. Ultra Fast Speech Separation Model with Teacher Student Learning

39. Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?

40. Speech Pre-training with Acoustic Piece

41. Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data

45. Self-Supervised Learning for speech recognition with Intermediate layer supervision

46. Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction

47. WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

48. Separating Long-Form Speech with Group-Wise Permutation Invariant Training

49. Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding

50. SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing

Catalog

Books, media, physical & digital resources