Search

Your search keyword '"Ling, Zhen"' showing total 1,547 results

Search Constraints

Start Over You searched for: Author "Ling, Zhen" Remove constraint Author: "Ling, Zhen"
1,547 results on '"Ling, Zhen"'

Search Results

1. SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention

2. Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models

3. TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network

4. RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation

5. Unispeaker: A Unified Approach for Multimodality-driven Speaker Generation

6. Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model

7. Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis

8. On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection

9. Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

10. A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions

11. ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram

12. SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features

13. Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion

14. MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios

15. APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm

16. Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration

17. ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs

18. Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation

19. Stage-Wise and Prior-Aware Neural Speech Phase Prediction

23. Transformer-Based Model for Auditory EEG Decoding

26. Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding

27. Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

28. Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control

29. BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation

30. Perturbation-Restrained Sequential Model Editing

31. Voice Attribute Editing with Text Prompt

32. Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks

33. Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval

34. APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding

35. One Train for Two Tasks: An Encrypted Traffic Classification Framework Using Supervised Contrastive Learning

36. Neighboring Perturbations of Knowledge Editing on Large Language Models

37. Corrective Retrieval Augmented Generation

38. Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction

39. Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue

40. MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding

41. Sparsity-Driven EEG Channel Selection for Brain-Assisted Speech Enhancement

42. APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra

43. Is ChatGPT a Good Multi-Party Conversation Solver?

44. Untying the Reversal Curse via Bidirectional Language Model Editing

45. Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement

46. Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment

47. Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

48. Long-frame-shift Neural Speech Phase Prediction with Spectral Continuity Enhancement and Interpolation Error Compensation

49. Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation

50. MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Catalog

Books, media, physical & digital resources