CRAFT-THU/RoDe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CRAFT-THU/RoDe)

CRAFT-THU / RoDe

A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs

☆30

Alternatives and similar repositories for RoDe

Users that are interested in RoDe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnonymousRepo123 / AlphaSparse
View on GitHub
A intelligent matrix format designer for SpMV
☆10Oct 10, 2023Updated 2 years ago
hgyhungry / ShflBW_Sparse_NN
View on GitHub
☆16Nov 22, 2022Updated 3 years ago
LucasWilkinson / ASpT-mirror
View on GitHub
Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding
☆17Oct 20, 2021Updated 4 years ago
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
luuhwy / VNEC
View on GitHub
VNEC: A Vectorized Non-Empty Column Format for SpMV on cross-platform multicore CPUs
☆10Feb 6, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ParCIS / FlashSparse
View on GitHub
FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…
☆39Oct 5, 2025Updated 9 months ago
spcl / smat
View on GitHub
Code for High Performance Unstructured SpMM Computation Using Tensor Cores
☆35Nov 3, 2024Updated last year
dgSPARSE / dgSPARSE-Lib
View on GitHub
PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity
☆122Jul 13, 2026Updated last week
GPUPeople / spECK
View on GitHub
Efficient SpGEMM on GPU using CUDA and CSR
☆61Jul 18, 2023Updated 3 years ago
google-research / sputnik
View on GitHub
A library of GPU kernels for sparse matrix operations.
☆289Nov 24, 2020Updated 5 years ago
Ivanrs297 / cuda-spmv-csr
View on GitHub
Parallel SpMV using CSR representation, built in CUDA
☆14Jun 27, 2020Updated 6 years ago
Hyaloid / AccSpMM
View on GitHub
Official implementation of Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores.
☆17Nov 13, 2025Updated 8 months ago
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
weifengliu-ssslab / Benchmark_SpGEMM_using_CSR
View on GitHub
CSR-based SpGEMM on nVidia and AMD GPUs
☆48Apr 9, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PASSIONLab / MaskedSpGEMM
View on GitHub
☆10Jul 4, 2022Updated 4 years ago
HPMLL / DTC-SpMM_ASPLOS24
View on GitHub
☆47Jun 19, 2024Updated 2 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
SuperScientificSoftwareLaboratory / DASP
View on GitHub
Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…
☆29Jun 18, 2024Updated 2 years ago
abhinav-vaishya / Fast-Training-of-Convolutional-Networks-through-FFTs
View on GitHub
Implementation of the paper - Fast Training of Convolutional Networks through FFTs (CUDA for parallelization)
☆10May 8, 2020Updated 6 years ago
UDC-GAC / openCNN
View on GitHub
A Winograd Minimal Filter Implementation in CUDA
☆31Aug 25, 2021Updated 4 years ago
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
HPMLL / SpInfer_EuroSys25
View on GitHub
☆35Apr 2, 2025Updated last year
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
uwsampl / sparsetir-artifact
View on GitHub
Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"
☆25Feb 24, 2023Updated 3 years ago
alan-hpc / cuda_op_benchmark
View on GitHub
方便扩展的Cuda算子理解和优化框架，仅用在学习使用
☆18Jun 13, 2024Updated 2 years ago
Bruce-Lee-LY / cuda_back2back_hgemm
View on GitHub
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
☆13Nov 3, 2023Updated 2 years ago
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
FPSG-UIUC / teaal-compiler
View on GitHub
☆25Oct 30, 2024Updated last year
cslab-ntua / artificial-matrix-generator
View on GitHub
An artificial matrix generator in C
☆13Feb 16, 2023Updated 3 years ago
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆74Oct 5, 2020Updated 5 years ago
sfu-arch / SPAGHETTI
View on GitHub
RTL generator for SpGEMM
☆12Feb 2, 2021Updated 5 years ago
NeuraChip / neurachip
View on GitHub
NeuraChip Accelerator Simulator
☆16Apr 26, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dumerrill / merge-spmv
View on GitHub
☆99Feb 10, 2017Updated 9 years ago
lixiuhong / implicit_gemm_convolution
View on GitHub
☆14May 28, 2019Updated 7 years ago
tsinghua-ideal / spada-sim
View on GitHub
The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow
☆47Jan 26, 2023Updated 3 years ago
RRZE-HPC / GHOST
View on GitHub
General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)
☆12Apr 8, 2021Updated 5 years ago
VITA-Group / Linearity-Grafting
View on GitHub
[ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…
☆16Jun 22, 2022Updated 4 years ago
ROCm / rocBLAS-Examples
View on GitHub
Examples illustrating usage of the rocBLAS library
☆17Aug 12, 2024Updated last year
hgyhungry / alcop-artifact
View on GitHub
☆25Mar 15, 2023Updated 3 years ago