Search

Your search keyword '"Liu, Shujie"' showing total 23 results

Search Constraints

Start Over You searched for: Author "Liu, Shujie" Remove constraint Author: "Liu, Shujie" Topic audio and speech processing (eess.as) Remove constraint Topic: audio and speech processing (eess.as)
23 results on '"Liu, Shujie"'

Search Results

1. On decoder-only architecture for speech-to-text and large language model integration

2. Accelerating Transducers through Adjacent Token Merging

3. Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition

4. Target Sound Extraction with Variable Cross-Modality Clues

5. Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation

6. VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

7. Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

8. ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation

9. Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

10. Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

11. BEATs: Audio Pre-Training with Acoustic Tokenizers

12. SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training

13. The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

14. Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding

15. LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers

16. SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

17. LongFNT: Long-form Speech Recognition with Factorized Neural Transducer

18. Self-Supervised Learning for speech recognition with Intermediate layer supervision

19. UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data

20. A Configurable Multilingual Model is All You Need to Recognize All Languages

21. UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset

22. Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification

23. Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer

Catalog

Books, media, physical & digital resources