Search

Your search keyword '"Yazdanbakhsh, Amir"' showing total 118 results

Search Constraints

Start Over You searched for: Author "Yazdanbakhsh, Amir" Remove constraint Author: "Yazdanbakhsh, Amir"
118 results on '"Yazdanbakhsh, Amir"'

Search Results

1. When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

2. ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

3. Effective Interplay between Sparsity and Quantization: From Theory to Practice

4. SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter Pretraining of LLMs

5. Tao: Re-Thinking DL-based Microarchitecture Simulation

6. DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics

7. Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

8. USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

9. JaxPruner: A concise library for sparsity research

10. Self-Refine: Iterative Refinement with Self-Feedback

11. In-Storage Domain-Specific Acceleration for Serverless Computing

12. Learning Performance-Improving Code Edits

13. STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition

14. GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation

15. Text and Patterns: For Effective Chain of Thought, It Takes Two to Tango

16. Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask

17. Sparse Attention Acceleration with Synergistic In-Memory Pruning and On-Chip Recomputation

18. Accelerating Attention through Gradient-Based Learned Runtime Pruning

19. Data-Driven Offline Optimization For Architecting Hardware Accelerators

20. FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks

21. An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks

22. Rethinking Co-design of Neural Architectures and Hardware Accelerators

23. Apollo: Transferable Architecture Exploration

24. Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation

25. Mixed-Signal Charge-Domain Acceleration of Deep Neural networks through Interleaved Bit-Partitioned Arithmetic

26. ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks

28. USM-Lite: Quantization and Sparsity Aware Fine-Tuning for Speech Recognition with Universal Speech Models

29. GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks

31. An assessment of the role of nanosilica in thermal/thermooxidative degradation mechanism of poly(lactic acid)/polybutylene adipate terephthalate blend nanocomposites.

39. Domain-Specific Computational Storage for Serverless Computing

Catalog

Books, media, physical & digital resources