Search

Your search keyword '"Kiela, Douwe"' showing total 299 results

Search Constraints

Start Over You searched for: Author "Kiela, Douwe" Remove constraint Author: "Kiela, Douwe"
299 results on '"Kiela, Douwe"'

Search Results

1. OLMoE: Open Mixture-of-Experts Language Models

2. Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

3. Lynx: An Open Source Hallucination Evaluation Model

4. Generative Representational Instruction Tuning

5. KTO: Model Alignment as Prospect Theoretic Optimization

6. I am a Strange Dataset: Metalinguistic Tests for Language Models

7. Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision

8. FinanceBench: A New Benchmark for Financial Question Answering

9. Anchor Points: Benchmarking Models with Much Fewer Examples

10. Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language

11. OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

12. AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages

13. Investigating Multi-source Active Learning for Natural Language Inference

14. Measuring Data

15. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

16. Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements

17. DataPerf: Benchmarks for Data-Centric AI Development

18. Perturbation Augmentation for Fairer NLP

19. Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality

20. Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks

21. Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants

22. FLAVA: A Foundational Language And Vision Alignment Model

23. Analyzing Dynamic Adversarial Training Data in the Limit

24. What's Hidden in a One-layer Randomly Weighted Transformer?

25. Human-Adversarial Visual Question Answering

26. On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study

27. True Few-Shot Learning with Language Models

28. Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking

29. Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation

30. Cross-Modal Retrieval Augmentation for Multi-Modal Classification

31. Retrieval Augmentation Reduces Hallucination in Conversation

32. Gradient-based Adversarial Attacks against Text Transformers

33. Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little

34. Dynabench: Rethinking Benchmarking in NLP

35. Quasi-Equivalence Discovery for Zero-Shot Emergent Communication

36. Rissanen Data Analysis: Examining Dataset Characteristics via Description Length

37. Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection

38. DynaSent: A Dynamic Benchmark for Sentiment Analysis

39. Reservoir Transformers

40. I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

41. To what extent do human explanations of model behavior align with actual model behavior?

42. Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations

43. ANLIzing the Adversarial Natural Language Inference Dataset

44. Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval

45. Learning Optimal Representations with the Decodable Information Bottleneck

46. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

47. The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

48. Multi-Dimensional Gender Bias Classification

49. Unsupervised Question Decomposition for Question Answering

50. I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents

Catalog

Books, media, physical & digital resources