Search

Your search keyword '"Weston, Jason"' showing total 484 results

Search Constraints

Start Over You searched for: Author "Weston, Jason" Remove constraint Author: "Weston, Jason"
484 results on '"Weston, Jason"'

Search Results

1. Better Alignment with Instruction Back-and-Forth Translation

2. Self-Taught Evaluators

3. Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

4. Distilling System 2 into System 1

5. Following Length Constraints in Instructions

6. Contextual Position Encoding: Learning to Count What's Important

7. Iterative Reasoning Preference Optimization

8. Reverse Training to Nurse the Reversal Curse

9. Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

10. TOOLVERIFIER: Generalization to New Tools via Self-Verification

11. Self-Rewarding Language Models

12. Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe Loss

13. System 2 Attention (is something you might need too)

14. The ART of LLM Refinement: Ask, Refine, and Trust

15. Branch-Solve-Merge Improves Large Language Model Evaluation and Generation

16. Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

17. Chain-of-Verification Reduces Hallucination in Large Language Models

18. Self-Alignment with Instruction Backtranslation

19. Leveraging Implicit Feedback from Deployment Data in Dialogue

20. System-Level Natural Language Feedback

21. The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges

22. Improving Open Language Models by Learning from Organic Interactions

23. Large Language Model Programs

24. Learning to Reason and Memorize with Self-Notes

25. Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models

26. The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

27. Infusing Commonsense World Models with Graph Knowledge

28. The CRINGE Loss: Learning what language not to model

29. When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels

30. Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback

31. BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage

32. Learning from data in the mixed adversarial non-adversarial case: Finding the helpers and ignoring the trolls

33. DIRECTOR: Generator-Classifiers For Supervised Language Modeling

34. Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt Completion

35. Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents

36. Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity

37. Reason first, then respond: Modular Generation for Knowledge-infused Dialogue

38. NormFormer: Improved Transformer Pretraining with Extra Normalization

39. Beyond Goldfish Memory: Long-Term Open-Domain Conversation

40. Internet-Augmented Dialogue Generation

41. Staircase Attention for Recurrent Processing of Sequences

42. Hash Layers For Large Sparse Models

43. Not All Memories are Created Equal: Learning to Forget by Expiring

44. Retrieval Augmentation Reduces Hallucination in Conversation

45. I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

46. Recipes for Safety in Open-domain Chatbots

47. Multi-Modal Open-Domain Dialogue

48. How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds

49. Deploying Lifelong Open-Domain Dialogue Learning

50. Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions

Catalog

Books, media, physical & digital resources