Search

Your search keyword '"Cheng, Ning"' showing total 2,064 results

Search Constraints

Start Over You searched for: Author "Cheng, Ning" Remove constraint Author: "Cheng, Ning"
2,064 results on '"Cheng, Ning"'

Search Results

1. PFID: Privacy First Inference Delegation Framework for LLMs

2. EffectiveASR: A Single-Step Non-Autoregressive Mandarin Speech Recognition Architecture with High Accuracy and Inference Speed

3. Touch100k: A Large-Scale Touch-Language-Vision Dataset for Touch-Centric Multimodal Representation

4. Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

5. RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis

6. RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval

7. Transformer in Touch: A Survey

8. Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL

9. MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

10. Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation

11. QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

12. EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning

13. EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization

14. CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition

15. Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset

16. Medical Speech Symptoms Classification via Disentangled Representation

17. Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

18. DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation

19. CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding

20. CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation

21. PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter

22. An In-depth Survey of Large Language Model-based Artificial Intelligence Agents

23. Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval

24. FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework

25. AOSR-Net: All-in-One Sandstorm Removal Network

26. DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks

27. Voice Conversion with Denoising Diffusion Probabilistic GAN Models

28. Machine Unlearning Methodology base on Stochastic Teacher Network

29. Symbolic & Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music

30. From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning

31. PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion

32. Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism

33. Prompt Guided Copy Mechanism for Conversational Question Answering

34. CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction

35. The mechanisms of Porphyromonas gingivalis–derived outer membrane vesicles-induced neurotoxicity and microglia activation

36. EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis

37. SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model

39. On the Calibration and Uncertainty with P\'{o}lya-Gamma Augmentation for Dialog Retrieval Models

40. Efficient Uncertainty Estimation with Gaussian Process for Reliable Dialog Response Retrieval

41. Improving EEG-based Emotion Recognition by Fusing Time-frequency And Spatial Representations

42. Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy

43. QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis

44. Improving Music Genre Classification from Multi-Modal Properties of Music and Genre Correlations Perspective

45. ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT

46. Prevalence and Epidemiological Characteristics of Venous Thromboembolism in Jiaxing City

47. Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition

48. Improving Imbalanced Text Classification with Dynamic Curriculum Learning

49. MetaSpeech: Speech Effects Switch Along with Environment for Metaverse

50. Semi-Supervised Learning Based on Reference Model for Low-resource TTS

Catalog

Books, media, physical & digital resources