AdamBrouwersHarries / cusparse_spmvLinks

Example use of cusparse's spmv routine, with benchmarking/reporting code

☆9

Alternatives and similar repositories for cusparse_spmv

Users that are interested in cusparse_spmv are comparing it to the libraries listed below

Sorting:

SuperScientificSoftwareLaboratory / TileSpMV
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆11Updated 2 years ago
pku-liang / FlexTensor
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆177Updated 3 years ago
c3sr / tcu_scope
☆51Updated 6 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆105Updated last year
weifengliu-ssslab / Benchmark_SpGEMM_using_CSR
CSR-based SpGEMM on nVidia and AMD GPUs
☆46Updated 9 years ago
ParCIS / Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆89Updated 2 years ago
apuaaChen / vectorSparse
☆31Updated 2 years ago
codyjrivera / tsm2x-imp
Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA
☆33Updated 4 years ago
ekondis / gpumembench
A GPU benchmark suite for assessing on-chip GPU memory bandwidth
☆106Updated 7 years ago
daadaada / turingas
Assembler for NVIDIA Volta and Turing GPUs
☆224Updated 3 years ago
PAA-NCIC / PPoPP2017_artifact
Third party assembler and GEMM library for NVIDIA Kepler GPU
☆81Updated 5 years ago
lixiuhong / batched_gemm
☆39Updated 5 years ago
daadaada / gas
☆45Updated 4 years ago
pku-liang / AMOS
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆113Updated 2 years ago
sjfeng1999 / gpu-arch-microbenchmark
Dissecting NVIDIA GPU Architecture
☆101Updated 3 years ago
sunlex0717 / DissectingTensorCores
☆104Updated last year
XiuYuLi / deepcore_source_code
Subpart source code of of deepcore v0.7
☆27Updated 5 years ago
owensgroup / merge-spmm
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆72Updated 4 years ago
dumerrill / merge-spmv
☆93Updated 8 years ago
StrongSpoon / tvm.schedule
examples for tvm schedule API
☆101Updated 2 years ago
chenxuhao / caffe-escoin
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
☆16Updated 6 years ago
nicolaswilde / cuda-tensorcore-hgemm
☆148Updated 6 months ago
GPUPeople / spECK
Efficient SpGEMM on GPU using CUDA and CSR
☆56Updated 2 years ago
LeiWang1999 / tvm_gpu_gemm
play gemm with tvm
☆91Updated 2 years ago
microsoft / ConvStencil
☆31Updated last year
NMSU-PEARL / PPT-GPU
Performance Prediction Toolkit for GPUs
☆37Updated 3 years ago
nicolaswilde / cuda-sgemm
☆67Updated 6 months ago
njuhope / cuda_sgemm
☆113Updated last year
md2z34 / winograd_gpu
GPU implementation of Winograd convolution
☆10Updated 7 years ago
wzsh / wmma_tensorcore_sample
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
☆138Updated 4 years ago