hpdps-group / hipSZLinks
A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.
β10Updated 10 months ago
Alternatives and similar repositories for hipSZ
Users that are interested in hipSZ are comparing it to the libraries listed below
Sorting:
- A GPU benchmark suite for assessing on-chip GPU memory bandwidthβ109Updated 8 years ago
- π GPU load-balancing library for regular and irregular computations.β64Updated 3 months ago
- β20Updated 6 years ago
- development repository for the open earth compilerβ81Updated 4 years ago
- β61Updated last year
- GPU Performance Advisorβ65Updated 3 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernelsβ32Updated 4 years ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeonβ’ and AMD Instinctβ’ acceleratorsβ124Updated last month
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)β36Updated 5 months ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018β73Updated 5 years ago
- An extension library of WMMA API (Tensor Core API)β109Updated last year
- A Benchmark Suite for Heterogeneous System Computationβ54Updated 10 months ago
- Assembler for NVIDIA Volta and Turing GPUsβ236Updated 3 years ago
- Efficient SpGEMM on GPU using CUDA and CSRβ59Updated 2 years ago
- A GPU accelerated error-bounded lossy compression for scientific data.β93Updated this week
- Benchmark for measuring the performance of sparse and irregular memory access.β82Updated 4 months ago
- Dissecting NVIDIA GPU Architectureβ115Updated 3 years ago
- β50Updated 6 years ago
- β293Updated 3 months ago
- β47Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repoβ253Updated 2 weeks ago
- Third party assembler and GEMM library for NVIDIA Kepler GPUβ85Updated 6 years ago
- CSR-based SpGEMM on nVidia and AMD GPUsβ46Updated 9 years ago
- β161Updated this week
- portDNN is a library implementing neural network algorithms written using SYCLβ113Updated last year
- A GPU algorithm for sparse matrix-matrix multiplicationβ73Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repoβ165Updated this week
- Chaiβ47Updated last month
- TLB Benchmarksβ35Updated 8 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repoβ178Updated last week