Search

Your search keyword '"Jia, Zhihao"' showing total 31 results

Search Constraints

Start Over You searched for: Author "Jia, Zhihao" Remove constraint Author: "Jia, Zhihao" Publication Type Electronic Resources Remove constraint Publication Type: Electronic Resources
31 results on '"Jia, Zhihao"'

Search Results

1. Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances

2. FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning

3. Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

4. Accelerating Retrieval-Augmented Language Model Serving with Speculation

5. Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models

6. SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices

7. Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

8. A Multi-Level Superoptimizer for Tensor Programs

9. SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification

10. Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems

11. SpotServe: Serving Generative Large Language Models on Preemptible Instances

12. Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey

13. Quarl: A Learning-Based Quantum Circuit Optimizer

14. Quark: A Gradient-Free Quantum Learning Framework for Classification Tasks

15. OLLIE: Derivation-based Tensor Program Optimizer

16. Optimizing Mixture of Experts using Dynamic Recompilations

17. Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs

18. Quartz: Superoptimization of Quantum Circuits (Extended Version)

19. TopoOpt: Co-optimizing Network Topology and Parallelization Strategy for Distributed Training Jobs

20. BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs

21. Quanto: Optimizing Quantum Circuits with Automatic Generation of Circuit Identities

22. Collage: Seamless Integration of Deep Learning Backends with Automatic Placement

23. TOD: GPU-accelerated Outlier Detection via Tensor Operations

24. GradSign: Model Performance Inference with Theoretical Insights

25. Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads

26. Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models

27. IOS: Inter-Operator Scheduler for CNN Acceleration

28. Redundancy-Free Computation Graphs for Graph Neural Networks

29. Beyond Data and Model Parallelism for Deep Neural Networks

30. Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks

31. Undefined behavior: what happened to my code?

Catalog

Books, media, physical & digital resources