Search

Your search keyword '"Liu, Shujie"' showing total 97 results

Search Constraints

Start Over You searched for: Author "Liu, Shujie" Remove constraint Author: "Liu, Shujie" Topic computer science - computation and language Remove constraint Topic: computer science - computation and language
97 results on '"Liu, Shujie"'

Search Results

1. Autoregressive Speech Synthesis without Vector Quantization

2. VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment

3. VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers

4. TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation

5. CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations

6. RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

7. WavLLM: Towards Robust and Adaptive Speech Large Language Model

8. Boosting Large Language Model for Speech Synthesis: An Empirical Study

9. COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning

10. WavMark: Watermarking for Audio Generation

11. SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

12. On decoder-only architecture for speech-to-text and large language model integration

13. Accelerating Transducers through Adjacent Token Merging

14. Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition

15. VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

16. ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation

17. Code-Switching Text Generation and Injection in Mandarin-English ASR

18. Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

19. Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

20. Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

21. BEATs: Audio Pre-Training with Acoustic Tokenizers

22. VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning

23. LongFNT: Long-form Speech Recognition with Factorized Neural Transducer

24. LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

25. Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation

26. SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training

27. SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

28. Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training

29. The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

30. Ultra Fast Speech Separation Model with Teacher Student Learning

31. Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?

32. Self-Supervised Learning for speech recognition with Intermediate layer supervision

33. Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction

34. WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

35. SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing

36. UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training

37. Jointly Learning to Repair Code and Generate Commit Message

38. Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation

39. A Configurable Multilingual Model is All You Need to Recognize All Languages

40. CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

41. UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data

42. Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer

43. Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset

44. CodeBLEU: a Method for Automatic Evaluation of Code Synthesis

45. GraphCodeBERT: Pre-training Code Representations with Data Flow

46. Continuous Speech Separation with Conformer

47. On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition

48. Curriculum Pre-training for End-to-End Speech Translation

49. MuTual: A Dataset for Multi-Turn Dialogue Reasoning

50. Semantic Mask for Transformer based End-to-End Speech Recognition

Catalog

Books, media, physical & digital resources