120 results on '"Hager, Georg"'
Search Results
2. MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
3. Physical Oscillator Model for Supercomputing
4. SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case Study
5. Making applications faster by asynchronous execution: Slowing down processes or relaxing MPI collectives
6. Analytic Modeling of Idle Waves in Parallel Programs: Communication, Cluster Topology, and Noise Impact
7. Orthogonal Layers of Parallelism in Large-Scale Eigenvalue Computations
8. ESSEX: Equipping Sparse Solvers For Exascale
9. Desynchronization and Wave Pattern Formation in MPI-Parallel and Hybrid Memory-Bound Programs
10. Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors
11. Performance Engineering for a Tall & Skinny Matrix Multiplication Kernels on GPUs
12. Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors
13. Application Knowledge Required: Performance Modeling for Fun and Profit
14. Analytical performance estimation during code generation on modern GPUs
15. The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs
16. Level-Based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication
17. On the Accuracy and Usefulness of Analytic Energy Models for Contemporary Multicore Processors
18. Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs
19. Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels
20. Improved Coefficients for Polynomial Filtering in ESSEX
21. An Analysis of Core- and Chip-Level Architectural Features in Four Generations of Intel Server Processors
22. Addressing White-box Modeling and Simulation Challenges in Parallel Computing
23. Towards an Exascale Enabled Sparse Solver Repository
24. Analysis of Intel’s Haswell Microarchitecture Using the ECM Model and Microbenchmarks
25. Validation of Hardware Events for Successful Performance Pattern Identification in High Performance Computing
26. Performance Engineering and Energy Efficiency of Building Blocks for Large, Sparse Eigenvalue Computations on Heterogeneous Supercomputers
27. Performance Analysis of the Kahan-Enhanced Scalar Product on Current Multicore Processors
28. Analytic performance model for parallel overlapping memory‐bound kernels
29. Execution‐Cache‐Memory modeling and performance tuning of sparse matrix‐vector multiplication and Lattice quantum chromodynamics on A64FX
30. ESSEX: Equipping Sparse Solvers for Exascale
31. YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures
32. Performance Patterns and Hardware Metrics on Modern Multicore Processors: Best Practices for Performance Engineering
33. Performance Engineering: From Numbers to Insight
34. Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX
35. MPC and Coarray Fortran: Alternatives to Classic MPI Implementations on the Examples of Scalable Lattice Boltzmann Flow Solvers
36. likwid-bench: An Extensible Microbenchmarking Platform for x86 Multicore Compute Nodes
37. LIKWID: Lightweight Performance Tools
38. Introduction to High Performance Computing for Scientists and Engineers
39. Complexities of Performance Prediction for Bandwidth-Limited Loop Kernels on Multi-Core Architectures
40. Luttinger, Peierls or Mott? Quantum Phase Transitions in Strongly Correlated 1D Electron–Phonon Systems
41. Performance Limitations for Sparse Matrix-Vector Multiplications on Current Multi-Core Environments
42. Introducing a Performance Model for Bandwidth-Limited Loop Kernels
43. Performance engineering for real and complex tall & skinny matrix multiplication kernels on GPUs
44. A domain-specific language and matrix-free stencil code for investigating electronic properties of Dirac and topological materials
45. A Recursive Algebraic Coloring Technique for Hardware-efficient Symmetric Sparse Matrix-vector Multiplication
46. Analytic performance modeling and analysis of detailed neuron simulations
47. Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels
48. Hybrid MPI and OpenMP Parallel Programming
49. Fast Sparse Matrix-Vector Multiplication for TeraFlop/s Computers
50. One-Dimensional Electron-Phonon Systems: Mott- Versus Peierls-Insulators
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.