Search

Your search keyword '"Meng, Zhong"' showing total 852 results

Search Constraints

Start Over You searched for: Author "Meng, Zhong" Remove constraint Author: "Meng, Zhong"
852 results on '"Meng, Zhong"'

Search Results

1. Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions

2. Text Injection for Neural Contextual Biasing

3. Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping

4. Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR

5. Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

6. SLM: Bridge the thin gap between speech and text foundation models

7. Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

8. Massive End-to-end Models for Short Search Queries

9. Augmenting conformers with structured state-space sequence models for online speech recognition

10. Text Injection for Capitalization and Turn-Taking Prediction in Speech Models

11. Improving Joint Speech-Text Representations Without Alignment

13. Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

14. JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

15. Modular Hybrid Autoregressive Transducer

16. Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings

18. Streaming Multi-Talker ASR with Token-Level Serialized Output Training

19. Continuous Speech Separation with Recurrent Selective Attention Network

20. Separating Long-Form Speech with Group-Wise Permutation Invariant Training

21. Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

22. Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR

23. Factorized Neural Transducer for Efficient Language Model Adaptation

25. A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio

26. Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

29. End-to-End Speaker-Attributed ASR with Transformer

30. Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone

31. Continuous Speech Separation with Ad Hoc Microphone Arrays

32. Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

33. Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings

34. The Immunomodulatory Function of Assembled Composite Nanopolypeptide Containing Bursal-Derived BP7 (CNPB7) in Promoting the Mucosal Immune Response within Poultry Immunization

35. Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription

36. Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR

37. Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

38. On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer

39. Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings

40. Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability

41. Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers

42. L-Vector: Neural Label Embedding for Domain Adaptation

43. Active Voice Authentication

44. Serialized Output Training for End-to-End Overlapped Speech Recognition

45. High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model

46. Continuous speech separation: dataset and analysis

47. Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition

48. Character-Aware Attention-Based End-to-End Speech Recognition

49. Speaker Adaptation for Attention-Based End-to-End Speech Recognition

50. Adversarial Speaker Adaptation

Catalog

Books, media, physical & digital resources