Search

Your search keyword '"Khashabi, Daniel"' showing total 210 results

Search Constraints

Start Over You searched for: Author "Khashabi, Daniel" Remove constraint Author: "Khashabi, Daniel"
210 results on '"Khashabi, Daniel"'

Search Results

1. GenEx: Generating an Explorable World

2. Generative World Explorer

3. How Effective Is Self-Consistency for Long-Context Problems?

4. Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

5. Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets

6. RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

7. Benchmarking Language Model Creativity: A Case Study on Code Generation

8. WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment

9. Core: Robust Factual Precision with Informative Sub-Claim Identification

10. Efficient Large Multi-modal Models via Visual Context Compression

11. Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell

12. DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation

13. SELF-[IN]CORRECT: LLMs Struggle with Discriminating Self-Generated Responses

14. Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

15. Dated Data: Tracing Knowledge Cutoffs in Large Language Models

16. Tur[k]ingBench: A Challenge Benchmark for Web Agents

17. RORA: Robust Free-Text Rationale Evaluation

18. AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies

19. k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text

20. The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

21. Do pretrained Transformers Learn In-Context by Gradient Descent?

22. SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation

23. Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models

24. The Trickle-down Impact of Reward (In-)consistency on RLHF

25. GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution

26. 'According to ...': Prompting Language Models Improves Quoting from Pre-Training Data

27. Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency

28. Self-Instruct: Aligning Language Models with Self-Generated Instructions

29. When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories

30. Generating Sequences by Learning to Self-Correct

31. The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

32. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

33. ProsocialDialog: A Prosocial Backbone for Conversational Agents

34. Representation Projection Invariance Mitigates Representation Collapse

35. Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

36. COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics

37. UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training

38. NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics

39. Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

40. Time Waits for No One! Analysis and Challenges of Temporal Misalignment

41. Hey AI, Can You Solve Complex Tasks by Talking to Agents?

42. Reframing Instructional Prompts to GPTk's Language

43. Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?

44. Cross-Task Generalization via Natural Language Crowdsourcing Instructions

45. GooAQ: Open Question Answering with Diverse Answer Types

46. Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge

47. GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation

48. Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies

49. ParsiNLU: A Suite of Language Understanding Challenges for Persian

50. UnQovering Stereotyping Biases via Underspecified Questions

Catalog

Books, media, physical & digital resources