Search

Your search keyword '"Yang, Wenkai"' showing total 11 results

Search Constraints

Start Over You searched for: Author "Yang, Wenkai" Remove constraint Author: "Yang, Wenkai" Topic computer science - computation and language Remove constraint Topic: computer science - computation and language
11 results on '"Yang, Wenkai"'

Search Results

1. Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization

2. Exploring Backdoor Vulnerabilities of Chat Models

3. Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents

4. Enabling Large Language Models to Learn from Rules

5. Towards Codable Watermarking for Injecting Multi-bits Information to LLMs

6. Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter

7. Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features

8. Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks

9. RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models

10. Well-classified Examples are Underestimated in Classification with Deep Neural Networks

11. Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models

Catalog

Books, media, physical & digital resources