Search

Your search keyword '"Verhelst, Marian"' showing total 37 results

Search Constraints

Start Over You searched for: Author "Verhelst, Marian" Remove constraint Author: "Verhelst, Marian" Database arXiv Remove constraint Database: arXiv
37 results on '"Verhelst, Marian"'

Search Results

1. OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling

2. MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices

3. Pack my weights and run! Minimizing overheads for in-memory computing accelerators

4. COAC: Cross-layer Optimization of Accelerator Configurability for Efficient CNN Processing

5. Optimising GPGPU Execution Through Runtime Micro-Architecture Parameter Analysis

6. CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories

7. Optimizing Layer-Fused Scheduling of Transformer Networks on Multi-accelerator Platforms

8. HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms

9. ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators

10. Analog or Digital In-memory Computing? Benchmarking through Quantitative Modeling

11. PATRONoC: Parallel AXI Transport Reducing Overhead for Networks-on-Chip targeting Multi-Accelerator DNN Platforms at the Edge

12. Precision-aware Latency and Energy Balancing on Multi-Accelerator Platforms for DNN Inference

13. Benchmarking and modeling of analog and digital SRAM in-memory computing architectures

14. SALSA: Simulated Annealing based Loop-Ordering Scheduler for DNN Accelerators

15. NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

16. Real-Time Acoustic Perception for Automotive Applications

17. TinyVers: A Tiny Versatile System-on-chip with State-Retentive eMRAM for ML Inference at the Extreme Edge

18. Towards Heterogeneous Multi-core Accelerators Exploiting Fine-grained Scheduling of Layer-Fused Deep Neural Networks

19. DeFiNES: Enabling Fast Exploration of the Depth-first Scheduling Space for DNN Accelerators through Analytical Modeling

20. DPU-v2: Energy-efficient execution of irregular directed acyclic graphs

21. Hardware-aware mobile building block evaluation for computer vision

22. Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention

23. DPU: DAG Processing Unit for Irregular Graphs with Precision-Scalable Posit Arithmetic in 28nm

24. Taxonomy and Benchmarking of Precision-Scalable MAC Arrays Under Enhanced DNN Dataflow Representation

25. GRAPHOPT: constrained-optimization-based parallelization of irregular graphs

26. Acceleration of probabilistic reasoning through custom processor architecture

27. ProbLP: A framework for low-precision probabilistic inference

28. Feed-Forward On-Edge Fine-tuning Using Static Synthetic Gradient Modules

29. ZigZag: A Memory-Centric Rapid DNN Accelerator Design Space Exploration Framework

30. Benchmarking TinyML Systems: Challenges and Direction

31. A multi-layered energy consumption model for smart wireless acoustic sensor networks

32. BinarEye: An Always-On Energy-Accuracy-Scalable Binary CNN Processor With All Memory On Chip in 28nm CMOS

33. Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion

34. Minimum Energy Quantized Neural Networks

35. A 0.3-2.6 TOPS/W Precision-Scalable Processor for Real-Time Large-Scale ConvNets

36. Energy-Efficient ConvNets Through Approximate Computing

37. Understanding interdependency through complex information sharing

Catalog

Books, media, physical & digital resources