Search

Your search keyword '"Sham P."' showing total 3,840 results

Search Constraints

Start Over You searched for: Author "Sham P." Remove constraint Author: "Sham P."
3,840 results on '"Sham P."'

Search Results

1. Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions

2. Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD Variants

3. Soup to go: mitigating forgetting during continual learning with model averaging

4. From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos

5. Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

6. Loss-to-Loss Prediction: Scaling Laws for All Datasets

7. How Does Critical Batch Size Scale in Pre-training?

8. Mixture of Parrots: Experts improve memorization more than reasoning

9. LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

10. Flash Inference: Near Linear Time Inference for Long Convolution Sequence Models and Beyond

11. Neural Coordination and Capacity Control for Inventory Management

12. SOAP: Improving and Stabilizing Shampoo using Adam

13. Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques

14. A Systematic Literature Review of Informal STEM Learning

15. Universal and scalable synthesis of photochromic single-atom catalysts for plastic recycling.

16. Functional multiomics reveals genetic and pharmacologic regulation of surface CD38 in multiple myeloma

17. An Edge AI System Based on FPGA Platform for Railway Fault Detection

18. Deconstructing What Makes a Good Optimizer for Language Models

19. Universal Length Generalization with Turing Programs

20. Eliminating Position Bias of Language Models: A Mechanistic Approach

21. A New Perspective on Shampoo's Preconditioner

22. DataComp-LM: In search of the next generation of training sets for language models

23. Transcendence: Generative Models Can Outperform The Experts That Train Them

24. CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

25. Scaling Laws in Linear Regression: Compute, Parameters, and Data

26. Phase 1 clinical trial of B-Cell Maturation Antigen (BCMA) NEX-T® Chimeric Antigen Receptor (CAR) T cell therapy CC-98633/BMS-986354 in participants with triple-class exposed multiple myeloma

27. Starfysh integrates spatial transcriptomic and histologic data to reveal heterogeneous tumor–immune hubs

35. Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass

36. Matching the Statistical Query Lower Bound for $k$-Sparse Parity Problems with Sign Stochastic Gradient Descent

37. Association of neurotransmitter pathway polygenic risk with specific symptom profiles in psychosis

48. Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

49. Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

50. Memristor-Based MobileNetV3 Circuit Design for Image Classification

Catalog

Books, media, physical & digital resources