46 results on '"Yazdanbakhsh, Amir"'
Search Results
2. USM-Lite: Quantization and Sparsity Aware Fine-Tuning for Speech Recognition with Universal Speech Models
3. Exploiting Intel® Advanced Matrix Extensions (AMX) for Large Language Model Inference
4. Architecture 2.0: Challenges and Opportunities
5. ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture Design
6. MESA: Microarchitecture Extensions for Spatial Architecture Generation
7. Towards Breaking the Memory Bandwidth Wall Using Approximate Value Prediction
8. FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
9. What Makes Chain-of-Thought Prompting Effective? A Counterfactual Study
10. An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks
11. GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
12. Sparse Attention Acceleration with Synergistic In-Memory Pruning and On-Chip Recomputation
13. Accelerating attention through gradient-based learned runtime pruning
14. IO-Aware Custom Instruction Exploration for Customizing Embedded Processors
15. Mixed-Signal Charge-Domain Acceleration of Deep Neural Networks through Interleaved Bit-Partitioned Arithmetic
16. ReLeQ : A Reinforcement Learning Approach for Automatic Deep Quantization of Neural Networks
17. AxMemo
18. In-DRAM near-data approximate acceleration for GPUs
19. SiMul: An Algorithm-Driven Approximate Multiplier Design for Machine Learning
20. SnaPEA: Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks
21. GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks
22. FlexiGAN: An End-to-End Solution for FPGA Acceleration of Generative Adversarial Networks
23. AxBench: A Multiplatform Benchmark Suite for Approximate Computing
24. Towards statistical guarantees in controlling quality tradeoffs for approximate acceleration
25. TABLA: A unified template-based framework for accelerating statistical machine learning
26. Mitigating the Memory Bottleneck With Approximate Load Value Prediction
27. RFVP
28. GRATER: An Approximation Workflow for Exploiting Data-Level Parallelism in FPGA Acceleration
29. Neural acceleration for GPU throughput processors
30. Comprehensive Circuit Failure Prediction for Logic and SRAM Using Virtual Aging
31. Axilog: Abstractions for Approximate Hardware Design and Reuse
32. Online and Operand-Aware Detection of Failures Utilizing False Alarm Vectors
33. Axilog: Abstractions for Approximate Hardware Design and Reuse
34. Axilog: Language Support for Approximate Hardware Design
35. Implementation-aware selection of the custom instruction set for extensible processors
36. Rollback-free value prediction with approximate loads
37. General-purpose code acceleration with limited-precision analog computation
38. General-purpose code acceleration with limited-precision analog computation
39. Customized pipeline and instruction set architecture for embedded processing engines
40. A new merit function for custom instruction selection under an area budget constraint
41. Instruction set architectural guidelines for embedded packet-processing engines
42. Locality considerations in exploring custom instruction selection algorithms
43. Energy-aware design space exploration of registerfile for extensible processors
44. Instruction reliability analysis for embedded processors
45. Reliability Analysis of Embedded Applications in Non-Uniform Fault Tolerant Processors
46. Architecture-Aware Graph-Covering Algorithm for Custom Instruction Selection
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.