Search

Your search keyword '"Tramèr, Florian"' showing total 12 results

Search Constraints

Start Over You searched for: Author "Tramèr, Florian" Remove constraint Author: "Tramèr, Florian" Topic computer science - computation and language Remove constraint Topic: computer science - computation and language
12 results on '"Tramèr, Florian"'

Search Results

1. Blind Baselines Beat Membership Inference Attacks for Foundation Models

2. Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs

3. Foundational Challenges in Assuring Alignment and Safety of Large Language Models

4. Query-Based Adversarial Prompt Generation

5. Universal Jailbreak Backdoors from Poisoned Human Feedback

6. Are aligned neural networks adversarially aligned?

7. Preventing Verbatim Memorization in Language Models Gives a False Sense of Privacy

8. Quantifying Memorization Across Neural Language Models

9. What Does it Mean for a Language Model to Preserve Privacy?

10. Counterfactual Memorization in Neural Language Models

11. Large Language Models Can Be Strong Differentially Private Learners

12. Extracting Training Data from Large Language Models

Catalog

Books, media, physical & digital resources