Search

Your search keyword '"Guo, Pengcheng"' showing total 758 results

Search Constraints

Start Over You searched for: Author "Guo, Pengcheng" Remove constraint Author: "Guo, Pengcheng"
758 results on '"Guo, Pengcheng"'

Search Results

1. SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR

2. Optimizing Dysarthria Wake-Up Word Spotting: An End-to-End Approach for SLT 2024 LRDWWS Challenge

3. NPU-NTU System for Voice Privacy 2024 Challenge

4. Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

5. Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

6. Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

7. CRMSP: A Semi-supervised Approach for Key Information Extraction with Class-Rebalancing and Merged Semantic Pseudo-Labeling

8. MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement

9. Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix

10. Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

11. Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder

12. An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

13. The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023

14. ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

15. MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition

16. Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

17. Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

18. SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR

22. Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

23. Timbre-reserved Adversarial Attack in Speaker Identification

25. Multimodal cell atlas of the ageing human skeletal muscle

26. A spatiotemporal atlas of cholestatic injury and repair in mice

27. A spatiotemporal atlas of mouse liver homeostasis and regeneration

28. TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent Reinforcement Learning

29. Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

30. Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification

31. BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR

32. TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition

33. Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

34. The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge

35. VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting

36. The Design of an Efficient Distributed Collaborative Scheduling Method and the Optimal Planning Strategy for Providing Photovoltaic Access Capacity

37. Controllability of Windmill Networks

38. TESSP: Text-Enhanced Self-Supervised Speech Pre-training

39. Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling

40. Preserving background sound in noise-robust voice conversion via multi-task learning

41. MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario

44. NWPU-ASLP System for the VoicePrivacy 2022 Challenge

45. Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism

46. Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

Catalog

Books, media, physical & digital resources