weifengliu-ssslab/Benchmark_SpGEMM_using_CSR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/weifengliu-ssslab/Benchmark_SpGEMM_using_CSR)

weifengliu-ssslab / Benchmark_SpGEMM_using_CSR

CSR-based SpGEMM on nVidia and AMD GPUs

☆48

Alternatives and similar repositories for Benchmark_SpGEMM_using_CSR

Users that are interested in Benchmark_SpGEMM_using_CSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

weifengliu-ssslab / Benchmark_SpMV_using_CSR
View on GitHub
CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)
☆26May 12, 2015Updated 11 years ago
weifengliu-ssslab / bhSPARSE
View on GitHub
bhSPARSE: A Sparse BLAS Library
☆17Nov 6, 2015Updated 10 years ago
weifengliu-ssslab / Benchmark_SpTRSM_using_CSC
View on GitHub
Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)
☆17Feb 14, 2020Updated 6 years ago
XiaosongAI / Parallel-SpMV
View on GitHub
稀疏矩阵-向量乘的并行优化算法（OpenMP，AVX）
☆11Jul 7, 2021Updated 5 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
YusukeNagasaka / Batched-SpMM
View on GitHub
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16May 7, 2019Updated 7 years ago
srkiranraj / spgemm
View on GitHub
Sparse matrix-matrix multiplication on CPU+GPU systems.
☆13Mar 17, 2014Updated 12 years ago
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
PASSIONLab / MaskedSpGEMM
View on GitHub
☆10Jul 4, 2022Updated 4 years ago
EBD-CREST / nsparse
View on GitHub
Sparse matrix computation library for GPU
☆59Jul 12, 2020Updated 6 years ago
CRAFT-THU / RoDe
View on GitHub
A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs
☆30Nov 29, 2023Updated 2 years ago
GPUPeople / ACSpGEMM
View on GitHub
Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"
☆31Jul 7, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cusplibrary / cusplibrary
View on GitHub
CUSP : A C++ Templated Sparse Matrix Library
☆424Jul 8, 2026Updated 2 weeks ago
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆74Oct 5, 2020Updated 5 years ago
Ivanrs297 / cuda-spmv-csr
View on GitHub
Parallel SpMV using CSR representation, built in CUDA
☆14Jun 27, 2020Updated 6 years ago
SuperScientificSoftwareLaboratory / DASP
View on GitHub
Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…
☆29Jun 18, 2024Updated 2 years ago
poojahira / spmv-cuda
View on GitHub
Implementation and analysis of five different GPU based SPMV algorithms in CUDA
☆39Feb 5, 2019Updated 7 years ago
HipGraph / FusedMM
View on GitHub
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…
☆31Aug 12, 2022Updated 3 years ago
danghvu / cudaSpmv
View on GitHub
CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format
☆22Jun 8, 2018Updated 8 years ago
BenjaminW3 / matmul
View on GitHub
Sequential and parallel GEMM implementations with C interface + Benchmark.
☆12May 24, 2016Updated 10 years ago
han-shi / SparseBERT
View on GitHub
☆13Nov 25, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cslab-ntua / SpMV-Research
View on GitHub
☆24Jun 12, 2026Updated last month
cslab-ntua / artificial-matrix-generator
View on GitHub
An artificial matrix generator in C
☆13Feb 16, 2023Updated 3 years ago
pnnl / HiParTI
View on GitHub
☆17Apr 8, 2021Updated 5 years ago
ChenhanYu / hmlp
View on GitHub
High-Performance Machine Learning Primitives
☆13Apr 17, 2021Updated 5 years ago
nulidangxueshen / ALBUS
View on GitHub
A Method for efficiently processing SpMV using SIMD and load balancing
☆17Apr 4, 2022Updated 4 years ago
lixiuhong / implicit_gemm_convolution
View on GitHub
☆14May 28, 2019Updated 7 years ago
ceruleangu / Block-Sparse-Benchmark
View on GitHub
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Aug 21, 2020Updated 5 years ago
eaymerich / Sparse2015
View on GitHub
Implementation of COO, CSR, CSC, SSS and TJDS sparse matrix formats.
☆11Jul 15, 2015Updated 11 years ago
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
davidrohr / caldgemm
View on GitHub
Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL
☆16Apr 5, 2018Updated 8 years ago
ParCIS / Ok-Topk
View on GitHub
Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…
☆27Dec 10, 2022Updated 3 years ago
roystgnr / MetaPhysicL
View on GitHub
Metaprogramming and operator-overloaded classes for numerical simulations
☆23Jun 11, 2026Updated last month
fbaru-dev / nbody-demo
View on GitHub
This is an example code based on a simple N-body simulation written in C++ which can be used to demonstrate the functionality of the Inte…
☆18Apr 26, 2021Updated 5 years ago
dumerrill / merge-spmv
View on GitHub
☆99Feb 10, 2017Updated 9 years ago
thesrsakabuvttchi / VLIW
View on GitHub
This is a simple VLIW based processor written in Verilog. A Python script has also been included to simulate static instruction schedulin…
☆18May 14, 2021Updated 5 years ago
alugowski / fast_matrix_market
View on GitHub
Fast and full-featured Matrix Market I/O library for C++, Python, and R
☆91Aug 5, 2024Updated last year