Search

Your search keyword '"Wang, Yujun"' showing total 61 results

Search Constraints

Start Over You searched for: Author "Wang, Yujun" Remove constraint Author: "Wang, Yujun" Publication Type Reports Remove constraint Publication Type: Reports
61 results on '"Wang, Yujun"'

Search Results

1. Optimizing Dysarthria Wake-Up Word Spotting: An End-to-End Approach for SLT 2024 LRDWWS Challenge

2. Efficient Extraction of Noise-Robust Discrete Units from Self-Supervised Speech Models

3. Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding

4. Bridging Language Gaps in Audio-Text Retrieval

5. Scaling up masked audio encoder learning for general audio classification

6. Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling

7. CED: Consistent ensemble distillation for audio tagging

8. Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction

9. Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

10. AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction

11. Understanding temporally weakly supervised training: A case study for keyword spotting

12. Streaming Audio Transformers for Online Audio Tagging

13. Exploring Representation Learning for Small-Footprint Keyword Spotting

14. Relate auditory speech to EEG by shallow-deep attention-based network

15. Improving Weakly Supervised Sound Event Detection with Causal Intervention

16. Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers

17. Improve Bilingual TTS Using Dynamic Language and Phonology Embedding

18. An empirical study of weakly supervised audio tagging embeddings for general audio representations

19. UniKW-AT: Unified Keyword Spotting and Audio Tagging

20. Pseudo strong labels for large scale weakly supervised audio tagging

21. Learning Decoupling Features Through Orthogonality Regularization

22. Detect what you want: Target Sound Detection

23. Improving Emotional Speech Synthesis by Using SUS-Constrained VAE and Text Encoder Aggregation

24. PAMA-TTS: Progression-Aware Monotonic Attention for Stable Seq2Seq TTS With Accurate Phoneme Duration Control

25. A Separable Temporal Convolution Neural Network with Attention for Small-Footprint Keyword Spotting

26. Separable Temporal Convolution plus Temporally Pooled Attention for Lightweight High-performance Keyword Spotting

27. Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model

28. Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information

29. GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

30. speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

31. Multi-Channel Automatic Speech Recognition Using Deep Complex Unet

32. Data Augmentation For Children's Speech Recognition -- The 'Ethiopian' System For The SLT 2021 Children Speech Recognition Challenge

33. AutoKWS: Keyword Spotting with Differentiable Architecture Search

34. Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis

35. RawNet: Fast End-to-End Neural Vocoder

36. End-to-end Models with auditory attention in Multi-channel Keyword Spotting

37. Sequence-to-sequence Models for Small-Footprint Keyword Spotting

38. Attention-based End-to-End Models for Small-Footprint Keyword Spotting

39. Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model

40. Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition

41. Attention-Based End-to-End Speech Recognition on Voice Search

42. State-to-state chemistry at ultra-low temperature

43. Role of the intraspecies scattering length in the Efimov scenario with large mass difference

44. Isotopic shift of atom-dimer Efimov resonances in K-Rb mixtures: Critical effect of multichannel Feshbach physics

45. Heteronuclear Efimov scenario with positive intraspecies scattering length

46. Few-body physics of ultracold atoms and molecules with long-range interactions

47. Universal van der Waals Physics for Three Ultracold Atoms

48. Ultracold mixtures of atomic Li-6 and Cs-133 with tunable interactions

49. Universal three-body recombination via resonant d-wave interactions

50. Universal three-body parameter in heteronuclear atomic systems

Catalog

Books, media, physical & digital resources