weifengliu-ssslab / Benchmark_SpMV_using_CSRLinks
CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)
☆27Updated 10 years ago
Alternatives and similar repositories for Benchmark_SpMV_using_CSR
Users that are interested in Benchmark_SpMV_using_CSR are comparing it to the libraries listed below
Sorting:
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆22Updated 5 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆105Updated last year
- CSR-based SpGEMM on nVidia and AMD GPUs☆46Updated 9 years ago
- ☆93Updated 8 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 5 years ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆12Updated 5 years ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆24Updated last year
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆72Updated 4 years ago
- Sparse matrix computation library for GPU☆56Updated 5 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆56Updated last year
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Updated 6 years ago
- The SHOC Benchmark Suite☆256Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆106Updated 7 years ago
- sparse matrix pre-processing library☆83Updated last year
- A GPU algorithm for sparse matrix-matrix multiplication☆71Updated 4 years ago
- ☆18Updated last year
- A sparse BLAS lib supporting multiple backends☆43Updated 4 months ago
- Flexible GPGPU instrumentation☆88Updated 5 years ago
- Asynchronous Multi-GPU Programming Framework☆46Updated 4 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆33Updated 4 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆41Updated 6 years ago
- ☆51Updated 6 years ago
- Subpart source code of of deepcore v0.7☆27Updated 5 years ago
- Medusa: Building GPU-based Parallel Sparse Graph Applications with Sequential C/C++ Code☆62Updated 4 years ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆40Updated last year
- ☆248Updated last month
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 8 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆246Updated this week