Search

Your search keyword '"Ding Liang"' showing total 137 results

Search Constraints

Start Over You searched for: Author "Ding Liang" Remove constraint Author: "Ding Liang" Database arXiv Remove constraint Database: arXiv
137 results on '"Ding Liang"'

Search Results

1. Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs

2. Self-Evolution Knowledge Distillation for LLM-based Machine Translation

3. DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs

4. CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models

5. Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL

6. Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models

7. Self-Powered LLM Modality Expansion for Large Speech-Text Models

8. End-to-End Graph Flattening Method for Large Language Models

9. MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators

10. Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models

11. $\mathbb{USCD}$: Improving Code Generation of LLMs by Uncertainty-Aware Selective Contrastive Decoding

12. Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models

13. Aligning Large Language Models from Self-Reference AI Feedback with one General Principle

14. Uncertainty Aware Learning for Language Model Alignment

15. Revisiting Catastrophic Forgetting in Large Language Model Tuning

16. Demystifying the Compression of Mixture-of-Experts Through a Unified Framework

17. Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning

18. 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset

19. Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems

20. Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

21. Building Accurate Translation-Tailored LLMs with Language Aware Instruction Tuning

22. Kernel Multigrid: Accelerate Back-fitting via Sparse Gaussian Process Regression

23. Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction

24. Towards Training A Chinese Large Language Model for Anesthesiology

25. Towards Alleviating Text-to-Image Retrieval Hallucination for CLIP in Zero-shot Learning

26. Healthcare Copilot: Eliciting the Power of General LLMs for Medical Consultation

27. Revisiting Knowledge Distillation for Autoregressive Language Models

28. ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding

29. DB-LLM: Accurate Dual-Binarization for Efficient LLMs

30. InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling

31. A General Theory for Kernel Packets: from state space model to compactly supported basis

32. Revisiting Demonstration Selection Strategies in In-Context Learning

33. WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge

34. OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models

35. Intention Analysis Makes LLMs A Good Jailbreak Defender

36. POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation

37. Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion

38. Exploring Sparsity in Graph Transformers

39. Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models

40. Improved Convergence Rate of Nested Simulation with LSE on Sieve

41. Merging Experts into One: Improving Computational Efficiency of Mixture of Experts

42. Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer

43. Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot Translation

44. Deep Model Fusion: A Survey

45. MerA: Merging Pretrained Adapters For Few-Shot Learning

46. Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models

47. Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?

48. Efficient Federated Learning via Local Adaptive Amended Optimizer with Linear Speedup

49. Free-Form Composition Networks for Egocentric Action Recognition

50. Unsupervised Dense Retrieval with Relevance-Aware Contrastive Pre-Training

Catalog

Books, media, physical & digital resources