ParCIS/Magicube

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ParCIS/Magicube)

ParCIS / Magicube

Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

☆92

Alternatives and similar repositories for Magicube

Users that are interested in Magicube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LucasWilkinson / ASpT-mirror
View on GitHub
Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding
☆17Oct 20, 2021Updated 4 years ago
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
google-research / sputnik
View on GitHub
A library of GPU kernels for sparse matrix operations.
☆289Nov 24, 2020Updated 5 years ago
oresths / tSparse
View on GitHub
A GPU algorithm for sparse matrix-matrix multiplication
☆74Oct 1, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
YusukeNagasaka / Batched-SpMM
View on GitHub
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
☆16May 7, 2019Updated 7 years ago
HPMLL / DTC-SpMM_ASPLOS24
View on GitHub
☆47Jun 19, 2024Updated 2 years ago
hgyhungry / ShflBW_Sparse_NN
View on GitHub
☆16Nov 22, 2022Updated 3 years ago
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆73Oct 5, 2020Updated 5 years ago
ParCIS / FlashSparse
View on GitHub
FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…
☆39Oct 5, 2025Updated 9 months ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
microsoft / ConvStencil
View on GitHub
☆37Apr 10, 2024Updated 2 years ago
SpRegTiling / sparse-register-tiling
View on GitHub
☆10Mar 2, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
YukeWang96 / GNNAdvisor_OSDI21
View on GitHub
Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.
☆71Mar 2, 2023Updated 3 years ago
microsoft / SparTA
View on GitHub
☆167Jul 22, 2024Updated last year
weifengliu-ssslab / Benchmark_SpTRSM_using_CSC
View on GitHub
Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)
☆17Feb 14, 2020Updated 6 years ago
ceruleangu / Block-Sparse-Benchmark
View on GitHub
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Aug 21, 2020Updated 5 years ago
vtsynergy / bb_segsort
View on GitHub
☆21Aug 21, 2023Updated 2 years ago
nullplay / Unified-Convolution-Framework
View on GitHub
☆10Apr 24, 2023Updated 3 years ago
ParCIS / Ok-Topk
View on GitHub
Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…
☆27Dec 10, 2022Updated 3 years ago
xiezhq-hermann / graphiler
View on GitHub
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Oct 3, 2022Updated 3 years ago
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dgSPARSE / dgSPARSE-Lib
View on GitHub
PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity
☆122Jul 13, 2026Updated last week
KnowingNothing / MatmulTutorial
View on GitHub
A Easy-to-understand TensorOp Matmul Tutorial
☆445Mar 5, 2026Updated 4 months ago
HipGraph / FusedMM
View on GitHub
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…
☆31Aug 12, 2022Updated 3 years ago
hku-systems / naspipe
View on GitHub
☆14Jan 12, 2022Updated 4 years ago
uwsampl / sparsetir-artifact
View on GitHub
Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"
☆25Feb 24, 2023Updated 3 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
araij / rabbit_order
View on GitHub
☆49Jan 30, 2026Updated 5 months ago
SC-Tech-Program / Author-Kit
View on GitHub
Instructions and templates for SC authors
☆17Aug 22, 2021Updated 4 years ago
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
weifengliu-ssslab / Benchmark_SpTRSV_using_CSC
View on GitHub
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)
☆23Feb 14, 2020Updated 6 years ago
YukeWang96 / MGG_OSDI23
View on GitHub
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Mar 17, 2024Updated 2 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
amazon-science / FeatGraph
View on GitHub
☆69Jun 16, 2021Updated 5 years ago
ustcadsl / GraphWalker
View on GitHub
☆19Jul 1, 2020Updated 6 years ago
CRAFT-THU / RoDe
View on GitHub
A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs
☆30Nov 29, 2023Updated 2 years ago
north-numerical-computing / tensor-cores-numerical-behavior
View on GitHub
Test suite for probing the numerical behavior of NVIDIA tensor cores
☆42Jul 24, 2024Updated last year