Search

Your search keyword '"D.1.3"' showing total 1,366 results

Search Constraints

Start Over You searched for: Descriptor "D.1.3" Remove constraint Descriptor: "D.1.3"
1,366 results on '"D.1.3"'

Search Results

1. Assembly of FETI dual operator using CUDA

2. Optimizing Fine-Grained Parallelism Through Dynamic Load Balancing on Multi-Socket Many-Core Systems

3. Complementing an imperative process algebra with a rely/guarantee logic

4. Work-Efficient Parallel Non-Maximum Suppression Kernels

5. Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference

6. FedAlign: Federated Domain Generalization with Cross-Client Feature Alignment

7. Communication-Efficient, 2D Parallel Stochastic Gradient Descent for Distributed-Memory Optimization

8. The B2Scala Tool: Integrating Bach in Scala with Security in Mind

9. A Gentle Overview of Asynchronous Session-based Concurrency: Deadlock Freedom by Typing

10. Tensor-product vertex patch smoothers for biharmonic problems

11. Cascaded Prediction and Asynchronous Execution of Iterative Algorithms on Heterogeneous Platforms

12. Precision-Aware Iterative Algorithms Based on Group-Shared Exponents of Floating-Point Numbers

13. An Evaluation of Massively Parallel Algorithms for DFA Minimization

14. Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security

15. A Study of Performance Portability in Plasma Physics Simulations

16. MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices

17. Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spread

18. FedStein: Enhancing Multi-Domain Federated Learning Through James-Stein Estimator

19. FLeNS: Federated Learning with Enhanced Nesterov-Newton Sketch

20. Handling expression evaluation under interference

21. DNA sequence alignment: An assignment for OpenMP, MPI, and CUDA/OpenCL

22. Conversational Concurrency

23. Challenging Portability Paradigms: FPGA Acceleration Using SYCL and OpenCL

24. Stream parallel skeleton optimization

25. Solving Large Rank-Deficient Linear Least-Squares Problems on Shared-Memory CPU Architectures and GPU Architectures

26. Enabling Practical Transparent Checkpointing for MPI: A Topological Sort Approach

27. Parallel Strategies for Best-First Generalized Planning

28. Scalable Dual Coordinate Descent for Kernel Methods

29. Vahana.jl -- A framework (not only) for large-scale agent-based models

30. Stencil Computations on AMD and Nvidia Graphics Processors: Performance and Tuning Strategies

31. Porting the grid-based 3D+3V hybrid-Vlasov kinetic plasma simulation Vlasiator to heterogeneous GPU architectures

32. Construction of a Byzantine Linearizable SWMR Atomic Register from SWSR Atomic Registers

33. Local Adjoints for Simultaneous Preaccumulations with Shared Inputs

34. Restructuring a concurrent refinement algebra

35. Data reification in a concurrent rely-guarantee algebra

36. Hybrid parallel discrete adjoints in SU2

37. A Systematic Literature Survey of Sparse Matrix-Vector Multiplication

38. How to Relax Instantly: Elastic Relaxation of Concurrent Data Structures

39. Reasoning about distributive laws in a concurrent refinement algebra

40. Rhizomes and Diffusions for Processing Highly Skewed Graphs on Fine-Grain Message-Driven Systems

41. Exploring the Design Space for Message-Driven Systems for Dynamic Graph Processing using CCA

42. Programming Distributed Collective Processes in the eXchange Calculus

43. On the relativistic viability of multi-automaton systems: essential concepts, challenges and prospects

44. Report of the DOE/NSF Workshop on Correctness in Scientific Computing, June 2023, Orlando, FL

45. FULL-W2V: Fully Exploiting Data Reuse for W2V on GPU-Accelerated Systems

46. High Performance Multiple Sequence Alignment Algorithms for Comparison of Microbial Genomes

47. Computing the k-th Eigenvalue of Symmetric $H^2$-Matrices

48. Efficient Algorithms for Monte Carlo Particle Transport on AI Accelerator Hardware

49. A Performance-Portable SYCL Implementation of CRK-HACC for Exascale

50. Compiler Testing With Relaxed Memory Models

Catalog

Books, media, physical & digital resources