SuperScientificSoftwareLaboratory/DASP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SuperScientificSoftwareLaboratory/DASP)

SuperScientificSoftwareLaboratory / DASP

Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication" by Yuechen Lu and Weifeng Liu.

☆29

Alternatives and similar repositories for DASP

Users that are interested in DASP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spcl / smat
View on GitHub
Code for High Performance Unstructured SpMM Computation Using Tensor Cores
☆35Nov 3, 2024Updated last year
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
cslab-ntua / artificial-matrix-generator
View on GitHub
An artificial matrix generator in C
☆13Feb 16, 2023Updated 3 years ago
AnonymousRepo123 / AlphaSparse
View on GitHub
A intelligent matrix format designer for SpMV
☆10Oct 10, 2023Updated 2 years ago
araij / rabbit_order
View on GitHub
☆49Jan 30, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gevtushenko / block_matrix_format_performance
View on GitHub
☆12Jan 19, 2020Updated 6 years ago
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆73Oct 5, 2020Updated 5 years ago
LucasWilkinson / ASpT-mirror
View on GitHub
Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding
☆17Oct 20, 2021Updated 4 years ago
UDC-GAC / venom
View on GitHub
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
☆62Nov 24, 2023Updated 2 years ago
cslab-ntua / SpMV-Research
View on GitHub
☆24Jun 12, 2026Updated last month
YukeWang96 / QGTC_PPoPP22
View on GitHub
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
☆30Feb 12, 2022Updated 4 years ago
uuudown / SBNN
View on GitHub
Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)
☆17Dec 9, 2020Updated 5 years ago
Xtra-Computing / G3
View on GitHub
G3: A Programmable GNN Training System on GPU
☆43Aug 29, 2020Updated 5 years ago
HPMLL / DTC-SpMM_ASPLOS24
View on GitHub
☆47Jun 19, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
CRAFT-THU / RoDe
View on GitHub
A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs
☆30Nov 29, 2023Updated 2 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
weifengliu-ssslab / Benchmark_SpGEMM_using_CSR
View on GitHub
CSR-based SpGEMM on nVidia and AMD GPUs
☆48Apr 9, 2016Updated 10 years ago
vtsynergy / bb_segsort
View on GitHub
☆21Aug 21, 2023Updated 2 years ago
guqiqi / Samoyeds
View on GitHub
Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)
☆16Jul 17, 2025Updated last year
monkey2000 / spv8-public
View on GitHub
SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.
☆29Mar 16, 2021Updated 5 years ago
google-research / sputnik
View on GitHub
A library of GPU kernels for sparse matrix operations.
☆289Nov 24, 2020Updated 5 years ago
concept-inversion / C-SAW
View on GitHub
A Framework for Graph Sampling and Random Walk on GPUs.
☆38Feb 3, 2025Updated last year
SuperScientificSoftwareLaboratory / PanguLU
View on GitHub
PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems
☆49Jun 25, 2026Updated 3 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sderek / CUDAAdvisor
View on GitHub
CUDAAdvisor: a GPU profiling tool
☆53Aug 24, 2018Updated 7 years ago
chenxuhao / gardenia
View on GitHub
GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
☆34Apr 3, 2022Updated 4 years ago
Bruce-Lee-LY / cuda_back2back_hgemm
View on GitHub
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
☆13Nov 3, 2023Updated 2 years ago
leos313 / DOOM_FPGA
View on GitHub
Accelerating a Classic 3D Video Game (The DOOM) on Heterogeneous Reconfigurable MPSoCs
☆21Jun 4, 2020Updated 6 years ago
chhzh123 / Krill
View on GitHub
An efficient concurrent graph processing system
☆46Oct 27, 2021Updated 4 years ago
hpc-ulisboa / gpuPTXModel
View on GitHub
GPU Static Modeling using PTX and Deep Structured Learning
☆19Apr 1, 2020Updated 6 years ago
poojahira / spmv-cuda
View on GitHub
Implementation and analysis of five different GPU based SPMV algorithms in CUDA
☆39Feb 5, 2019Updated 7 years ago
ParCIS / Magicube
View on GitHub
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆92Nov 23, 2022Updated 3 years ago
faldupriyank / grasp
View on GitHub
Source code for the evaluated benchmarks and proposed cache management technique, GRASP, in [Faldu et al., HPCA'20].
☆18Jan 23, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ParCIS / FlashSparse
View on GitHub
FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…
☆39Oct 5, 2025Updated 9 months ago
PASSIONLab / MaskedSpGEMM
View on GitHub
☆10Jul 4, 2022Updated 4 years ago
GPUPeople / ACSpGEMM
View on GitHub
Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"
☆31Jul 7, 2020Updated 6 years ago
stanford-mast / Grazelle-PPoPP18
View on GitHub
Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"
☆23Apr 23, 2020Updated 6 years ago
horizon-research / imagen
View on GitHub
☆10Mar 8, 2025Updated last year
RRZE-HPC / GHOST
View on GitHub
General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)
☆12Apr 8, 2021Updated 5 years ago
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago