HPMLL/DTC-SpMM_ASPLOS24

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HPMLL/DTC-SpMM_ASPLOS24)

HPMLL / DTC-SpMM_ASPLOS24

☆47

Alternatives and similar repositories for DTC-SpMM_ASPLOS24

Users that are interested in DTC-SpMM_ASPLOS24 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
spcl / smat
View on GitHub
Code for High Performance Unstructured SpMM Computation Using Tensor Cores
☆35Nov 3, 2024Updated last year
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
ParCIS / Magicube
View on GitHub
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆92Nov 23, 2022Updated 3 years ago
xxyux / SpInfer
View on GitHub
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
☆68Mar 25, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
UDC-GAC / venom
View on GitHub
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
☆62Nov 24, 2023Updated 2 years ago
ParCIS / FlashSparse
View on GitHub
FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…
☆39Oct 5, 2025Updated 9 months ago
guqiqi / Samoyeds
View on GitHub
Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)
☆16Jul 17, 2025Updated last year
tsinghua-ideal / spada-sim
View on GitHub
The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow
☆47Jan 26, 2023Updated 3 years ago
horizon-research / imagen
View on GitHub
☆10Mar 8, 2025Updated last year
SuperScientificSoftwareLaboratory / DASP
View on GitHub
Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…
☆29Jun 18, 2024Updated 2 years ago
google-research / sputnik
View on GitHub
A library of GPU kernels for sparse matrix operations.
☆289Nov 24, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
araij / rabbit_order
View on GitHub
☆49Jan 30, 2026Updated 5 months ago
nullplay / Unified-Convolution-Framework
View on GitHub
☆10Apr 24, 2023Updated 3 years ago
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
abhibambhaniya / progressive_gradient_flow_nm_sparsity
View on GitHub
Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".
☆11Feb 5, 2024Updated 2 years ago
temporal-hpc / reduction-tensor-cores
View on GitHub
Fast GPU based tensor core reductions
☆12Jan 13, 2023Updated 3 years ago
SalesforceAIResearch / LeastLoadedEP
View on GitHub
☆18Jun 2, 2026Updated last month
lenLRX / AmpereSparseMatmul
View on GitHub
study of Ampere' Sparse Matmul
☆18Jan 10, 2021Updated 5 years ago
YusukeNagasaka / Batched-SpMM
View on GitHub
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16May 7, 2019Updated 7 years ago
HPMLL / SpInfer_EuroSys25
View on GitHub
☆35Apr 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ModelTC / pyvlova
View on GitHub
Yet another Polyhedra Compiler for DeepLearning
☆19Apr 14, 2023Updated 3 years ago
CRAFT-THU / RoDe
View on GitHub
A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs
☆30Nov 29, 2023Updated 2 years ago
UofT-EcoSystem / Minuet
View on GitHub
[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs
☆80Jun 7, 2024Updated 2 years ago
Guangxuan-Xiao / SPMM-CUDA
View on GitHub
☆13Jun 23, 2022Updated 4 years ago
georgia-tech-synergy-lab / SparseAccelerator-RTL
View on GitHub
Accelerator RTL inspired by VEGETA [HPCA'23] and MicroScopiQ [ISCA'25]
☆15Nov 11, 2025Updated 8 months ago
kaitoukito / Integrated-Circuit-Documents
View on GitHub
☆13Mar 19, 2025Updated last year
OpenBitSys / BitDecoding
View on GitHub
[HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.
☆96May 14, 2026Updated 2 months ago
rox906 / tcFFT
View on GitHub
☆43May 21, 2021Updated 5 years ago
vtsynergy / bb_segsort
View on GitHub
☆21Aug 21, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
oresths / tSparse
View on GitHub
A GPU algorithm for sparse matrix-matrix multiplication
☆74Oct 1, 2020Updated 5 years ago
jiazhihao / sosp19ae
View on GitHub
Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
☆21Apr 15, 2022Updated 4 years ago
Hazuyuki / PIM-HLS
View on GitHub
☆12Aug 18, 2023Updated 2 years ago
uuudown / SBNN
View on GitHub
Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)
☆17Dec 9, 2020Updated 5 years ago
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆73Oct 5, 2020Updated 5 years ago
VITA-Group / Structure-LTH
View on GitHub
[ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…
☆33Apr 9, 2023Updated 3 years ago
hgyhungry / ShflBW_Sparse_NN
View on GitHub
☆16Nov 22, 2022Updated 3 years ago