Search

Your search keyword '"Zheng, Lianmin"' showing total 123 results

Search Constraints

Start Over You searched for: Author "Zheng, Lianmin" Remove constraint Author: "Zheng, Lianmin" Publication Year Range Last 10 years Remove constraint Publication Year Range: Last 10 years
123 results on '"Zheng, Lianmin"'

Search Results

1. Experiment Research on Feasibility of In-Situ Plasma Cleaning in Normal-conducting Copper Cavities

2. Post-Training Sparse Attention with Double Sparsity

3. Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

4. SGLang: Efficient Execution of Structured Language Model Programs

5. Rethinking Benchmark and Contamination for Language Models with Rephrased Samples

6. S-LoRA: Serving Thousands of Concurrent LoRA Adapters

7. Mapping electrostatic potential in electrolyte solution

8. LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

9. Efficient Memory Management for Large Language Model Serving with PagedAttention

10. H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

11. Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

12. On Optimal Caching and Model Multiplexing for Large Model Inference

13. FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

14. AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

15. On Optimizing the Communication of Model Parallelism

16. TensorIR: An Abstraction for Automatic Tensorized Program Optimization

17. NumS: Scalable Array Programming for the Cloud

18. GACT: Activation Compressed Training for Generic Network Architectures

19. Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning

20. Eliminating uncertainty of thermal emittance measurement in solenoid scans due to rf and solenoid fields overlap

21. ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

23. Ansor: Generating High-Performance Tensor Programs for Deep Learning

24. Scalable and Efficient Systems for Large Deep Learning Models

25. A Unified Optimization Approach for CNN Model Inference on Integrated GPUs

26. Development and high-power testing of an X-band dielectric-loaded power extractor

27. Rapid thermal emittance and quantum efficiency mapping of a cesium telluride cathode in an rf photoinjector using multiple laser beamlets

28. Experimental demonstration of the correction of coupled transverse dynamics aberration in an rf photoinjector

30. Overestimation of thermal emittance in solenoid scans due to coupled transverse motion

31. A Hardware-Software Blueprint for Flexible Deep Learning Specialization

32. Learning to Optimize Tensor Programs

33. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning

34. Size-to-depth: A New Perspective for Single Image Depth Estimation

35. MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

38. Effects of Laser Pulse Heating of Copper Photocathodes on High-brightness Electron Beam Production at Blowout Regime

39. Development of an L-band continuous-wave buncher at Tsinghua University

40. Design, fabrication, and beam commissioning of a 216.667 MHz continuous-wave photocathode very-high-frequency electron gun

43. High-throughput Generative Inference of Large Language Models with a Single GPU

49. Rapid thermal emittance and quantum efficiency mapping of a cesium telluride cathode in an rf photoinjector using multiple laser beamlets

Catalog

Books, media, physical & digital resources