Search

Showing total 135 results
135 results

Search Results

1. Cache-Aided Matrix Multiplication Retrieval.

2. An Interpretive Structural Analysis for Industry 4.0 Adoption Challenges.

3. Matrix Function Optimization Problems Under Orthonormal Constraint.

4. Straggler Mitigation in Distributed Matrix Multiplication: Fundamental Limits and Optimal Coding.

5. A Unifying Framework to Construct QC-LDPC Tanner Graphs of Desired Girth.

6. Rook Coding for Batch Matrix Multiplication.

7. Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors.

8. EGCN: An Efficient GCN Accelerator for Minimizing Off-Chip Memory Access.

9. Memory-Efficient Deformable Convolution Based Joint Denoising and Demosaicing for UHD Images.

10. SOT-MRAM Digital PIM Architecture With Extended Parallelism in Matrix Multiplication.

11. Efficient Adaptive Online Learning via Frequent Directions.

12. Propagating Uncertainty in Power System Initial Conditions Using Data-Driven Linear Operators.

13. Low-Complexity Block Coordinate Descend Based Multiuser Detection for Uplink Grant-Free NOMA.

14. An Efficient Spatial Covariance Matrix Reconstruction Algorithm in the Hybrid Analog-Digital Structure.

15. Hierarchical Coded Matrix Multiplication in Heterogeneous Multihop Networks.

16. Multi-Objective Matrix Normalization for Fine-Grained Visual Recognition.

17. Efficient Job Offloading in Heterogeneous Systems Through Hardware-Assisted Packet-Based Dispatching and User-Level Runtime Infrastructure.

18. On the Solvability of Feedback Complete Linearization of Nonlinear Stochastic Systems.

19. Scalar MSCR Codes via the Product Matrix Construction.

20. Improving Efficiency of Parallel Vertex-Centric Algorithms for Irregular Graphs.

21. On Decoding Binary Quasi-Reversible BCH Codes.

22. The Unicorn Runtime: Efficient Distributed Shared Memory Programming for Hybrid CPU-GPU Clusters.

23. Label Propagated Nonnegative Matrix Factorization for Clustering.

24. Coded Computation Over Heterogeneous Clusters.

25. Improved Constructions for Secure Multi-Party Batch Matrix Multiplication.

26. CodedSketch: A Coding Scheme for Distributed Computation of Approximated Matrix Multiplication.

27. Co-Clustering Ensembles Based on Multiple Relevance Measures.

28. Coded Computing and Cooperative Transmission for Wireless Distributed Matrix Multiplication.

29. Hierarchical Coded Matrix Multiplication.

30. Data Encoding for Byzantine-Resilient Distributed Optimization.

31. X-Secure T-Private Information Retrieval From MDS Coded Storage With Byzantine and Unresponsive Servers.

32. Forward-Inverse 2D Hardware Implementation of Approximate Transform Core for the VVC Standard.

33. Multi-Kernel Polar Codes: Concept and Design Principles.

34. On the Capacity of MIMO Optical Wireless Channels.

35. Efficient Hardware for Generalized Turbo Signal Recovery in Compressed Sensing.

36. Design and Analysis of Area and Power Efficient Approximate Booth Multipliers.

37. HitGraph: High-throughput Graph Processing Framework on FPGA.

38. Flexible Distributed Matrix Multiplication.

39. Analyzing and Increasing the Reliability of Convolutional Neural Networks on GPUs.

40. Efficient Implementations of Reduced Precision Redundancy (RPR) Multiply and Accumulate (MAC).

41. On the Restricted Isometry of the Columnwise Khatri--Rao Product.

42. Optimization of Constant Matrix Multiplication with Low Power and High Throughput.

43. Codesign Tradeoffs for High-Performance, Low-Power Linear Algebra Architectures.

44. Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity.

45. Sparse Matrix Multiplication On An Associative Processor.

46. Autotuning GEMM Kernels for the Fermi GPU.

47. 32-Bit 4 × 4 Bit-Slice RSFQ Matrix Multiplier.

48. Beyond the Roofline: Cache-Aware Power and Energy-Efficiency Modeling for Multi-Cores.

49. Exploring the Intrinsic Features of EEG Signals via Empirical Mode Decomposition for Depression Recognition.

50. ReHy: A ReRAM-Based Digital/Analog Hybrid PIM Architecture for Accelerating CNN Training.