PASSIONLab/MaskedSpGEMM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PASSIONLab/MaskedSpGEMM)

PASSIONLab / MaskedSpGEMM

☆10

Alternatives and similar repositories for MaskedSpGEMM

Users that are interested in MaskedSpGEMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
monkey2000 / spv8-public
View on GitHub
SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.
☆29Mar 16, 2021Updated 5 years ago
hpcde / spmv-acc
View on GitHub
HIP acceleration of SpMV solver
☆13May 17, 2025Updated last year
marcsous / gpuSparse
View on GitHub
Matlab mex wrappers to cuSPARSE (NVIDIA)
☆11Dec 10, 2025Updated 7 months ago
PASSIONLab / CombBLAS
View on GitHub
The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …
☆82Jun 4, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Algebraic-Programming / ALP
View on GitHub
Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…
☆33Apr 2, 2026Updated 3 months ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
weifengliu-ssslab / Benchmark_SpGEMM_using_CSR
View on GitHub
CSR-based SpGEMM on nVidia and AMD GPUs
☆48Apr 9, 2016Updated 10 years ago
poojahira / spmv-cuda
View on GitHub
Implementation and analysis of five different GPU based SPMV algorithms in CUDA
☆39Feb 5, 2019Updated 7 years ago
oresths / tSparse
View on GitHub
A GPU algorithm for sparse matrix-matrix multiplication
☆74Oct 1, 2020Updated 5 years ago
cslab-ntua / artificial-matrix-generator
View on GitHub
An artificial matrix generator in C
☆13Feb 16, 2023Updated 3 years ago
Xilinx / SME-Developer-Labs
View on GitHub
☆14Sep 29, 2018Updated 7 years ago
CRAFT-THU / RoDe
View on GitHub
A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs
☆30Nov 29, 2023Updated 2 years ago
temporal-hpc / reduction-tensor-cores
View on GitHub
Fast GPU based tensor core reductions
☆12Jan 13, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RRZE-HPC / GHOST
View on GitHub
General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)
☆12Apr 8, 2021Updated 5 years ago
GPUPeople / ACSpGEMM
View on GitHub
Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"
☆31Jul 7, 2020Updated 6 years ago
AnonymousRepo123 / AlphaSparse
View on GitHub
A intelligent matrix format designer for SpMV
☆10Oct 10, 2023Updated 2 years ago
misa-kmc / misa-akmc
View on GitHub
kmc simulation of vacancy-dumbbell transition for BCC lattice.
☆13Aug 20, 2025Updated 11 months ago
mean9park / BitFusion-verilog
View on GitHub
bitfusion verilog implementation
☆13Feb 21, 2022Updated 4 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
tsinghua-ideal / spada-sim
View on GitHub
The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow
☆47Jan 26, 2023Updated 3 years ago
Ivanrs297 / cuda-spmv-csr
View on GitHub
Parallel SpMV using CSR representation, built in CUDA
☆14Jun 27, 2020Updated 6 years ago
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GPUPeople / spECK
View on GitHub
Efficient SpGEMM on GPU using CUDA and CSR
☆61Jul 18, 2023Updated 3 years ago
Luca-Dalmasso / matrixTransposeCUDA
View on GitHub
CUDA C simple application for Nvidia's GPU
☆11Jun 7, 2022Updated 4 years ago
Bruce-Lee-LY / memory_pool
View on GitHub
Simple and efficient memory pool is implemented with C++11.
☆10Jun 2, 2022Updated 4 years ago
guoyang9 / PELA
View on GitHub
PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]
☆19Apr 14, 2024Updated 2 years ago
wjc404 / GEMM_AVX512F
View on GitHub
SGEMM and DGEMM subroutines using AVX512F instructions.
☆15May 22, 2022Updated 4 years ago
li199603 / parallel_prefix_sum
View on GitHub
Parallel Prefix Sum (Scan) with CUDA
☆30Jun 22, 2024Updated 2 years ago
pku-liang / Sanger
View on GitHub
A co-design architecture on sparse attention
☆55Aug 23, 2021Updated 4 years ago
Zhu-ZiXuan / Bitlet-PE
View on GitHub
A bit-level sparsity-awared multiply-accumulate process element.
☆19Jul 9, 2024Updated 2 years ago
danghvu / cudaSpmv
View on GitHub
CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format
☆22Jun 8, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
philipl / nv-video-info
View on GitHub
Utilities to print information about video encode/decode capabilities of nvidia GPUs
☆35Jun 25, 2026Updated 3 weeks ago
XiaosongAI / Parallel-SpMV
View on GitHub
稀疏矩阵-向量乘的并行优化算法（OpenMP，AVX）
☆11Jul 7, 2021Updated 5 years ago
jacobaustin123 / Python-C-API-CUDA-Tutorial
View on GitHub
A tutorial/example of the Python C-API and integration with CUDA kernels.
☆14Jul 7, 2019Updated 7 years ago
aneesh297 / Sparse-Matrix-Vector-Multiplication
View on GitHub
SpMV using CUDA
☆20Mar 5, 2018Updated 8 years ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆73Oct 5, 2020Updated 5 years ago
ChengZhang-98 / LQER
View on GitHub
Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"
☆19Jul 11, 2024Updated 2 years ago