Search

Your search keyword '"Wang, Weiran"' showing total 523 results

Search Constraints

Start Over You searched for: Author "Wang, Weiran" Remove constraint Author: "Wang, Weiran"
523 results on '"Wang, Weiran"'

Search Results

1. Text Injection for Neural Contextual Biasing

2. Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR

3. TransformerFAM: Feedback attention is working memory

4. Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

5. USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

6. Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

7. Massive End-to-end Models for Short Search Queries

8. Augmenting conformers with structured state-space sequence models for online speech recognition

9. Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

10. The Rise and Potential of Large Language Model Based Agents: A Survey

11. Text Injection for Capitalization and Turn-Taking Prediction in Speech Models

13. Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR

15. JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

16. JOIST: A Joint Speech and Text Streaming Model For ASR

17. Improving Deliberation by Text-Only and Semi-Supervised Training

18. NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

19. Streaming Align-Refine for Non-autoregressive Deliberation

20. Improving Rare Word Recognition with LM-aware MWER Training

21. A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

25. Deliberation of Streaming RNN-Transducer by Non-autoregressive Decoding

26. Contrastively Disentangled Sequential Variational Autoencoder

30. Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability Perspective

36. Representation Learning for Sequence Data with Deep Autoencoding Predictive Components

37. An investigation of phone-based subword units for end-to-end speech recognition

38. A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification

39. Unsupervised Pre-training of Bidirectional Speech Encoders via Masked Reconstruction

40. Data Techniques For Online End-to-end Speech Recognition

41. Semi-supervised ASR by End-to-end Self-training

43. Acoustic scene analysis with multi-head attention networks

47. Multimodal and Multi-view Models for Emotion Recognition

48. Everything old is new again: A multi-view learning approach to learning using privileged information and distillation

50. Reconstructing 3D Contour Models of General Scenes from RGB-D Sequences

Catalog

Books, media, physical & digital resources