Search

Your search keyword '"Ré, Christopher"' showing total 184 results

Search Constraints

Start Over You searched for: Author "Ré, Christopher" Remove constraint Author: "Ré, Christopher" Database arXiv Remove constraint Database: arXiv
184 results on '"Ré, Christopher"'

Search Results

1. ThunderKittens: Simple, Fast, and Adorable AI Kernels

2. LoLCATs: On Low-Rank Linearizing of Large Language Models

3. Automated Rewards via LLM-Generated Progress Functions

4. Restructuring Vector Quantization with the Rotation Trick

5. Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates

6. Archon: An Architecture Search Framework for Inference-Time Techniques

7. Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

8. Just read twice: closing the recall gap for recurrent language models

9. WONDERBREAD: A Benchmark for Evaluating Multimodal Foundation Models on Business Process Management Tasks

10. State-Free Inference of State-Space Models: The Transfer Function Approach

11. Automating the Enterprise with Foundation Models

12. Mechanistic Design and Scaling of Hybrid Architectures

13. Simple linear attention language models balance the recall-throughput tradeoff

14. Prospector Heads: Generalized Feature Attribution for Large Models & Data

15. Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

16. Hydragen: High-Throughput LLM Inference with Shared Prefixes

17. The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry

18. Zoology: Measuring and Improving Recall in Efficient Language Models

19. FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

20. Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions

21. Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

22. Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

23. Context-Aware Meta-Learning

24. LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

25. Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models

26. Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification

27. Fast Algorithms for a New Relaxation of Optimal Transport

28. H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

29. Towards trustworthy seizure onset detection using workflow notes

30. TART: A plug-and-play Transformer module for task-agnostic reasoning

31. Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes

32. Effectively Modeling Time Series with Simple Discrete State Spaces

33. FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

34. Collage Diffusion

35. Hyena Hierarchy: Towards Larger Convolutional Language Models

36. Simple Hardware-Efficient Long Convolutions for Sequence Modeling

37. Hungry Hungry Hippos: Towards Language Modeling with State Space Models

38. Transform Once: Efficient Operator Learning in Frequency Domain

39. Holistic Evaluation of Language Models

40. S4ND: Modeling Images and Videos as Multidimensional Signals Using State Spaces

41. Ask Me Anything: A simple strategy for prompting language models

42. HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions

43. LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning

44. Contrastive Adapters for Foundation Model Group Robustness

45. How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections

46. On the Parameterization and Initialization of Diagonal State Space Models

47. Self-Supervised Learning of Brain Dynamics from Broad Neuroimaging Data

48. The Importance of Background Information for Out of Distribution Generalization

49. Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees

50. Decentralized Training of Foundation Models in Heterogeneous Environments

Catalog

Books, media, physical & digital resources