Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication" by Yuechen Lu and Weifeng Liu.
☆29Jun 18, 2024Updated last year
Alternatives and similar repositories for DASP
Users that are interested in DASP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for High Performance Unstructured SpMM Computation Using Tensor Cores☆35Nov 3, 2024Updated last year
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- ☆48Jan 30, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Jan 19, 2020Updated 6 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆73Oct 5, 2020Updated 5 years ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆17Oct 20, 2021Updated 4 years ago
- Parallel SpMV using CSR representation, built in CUDA☆14Jun 27, 2020Updated 5 years ago
- ☆23Feb 5, 2026Updated 2 months ago
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆60Nov 24, 2023Updated 2 years ago
- ☆32Aug 24, 2022Updated 3 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆30Feb 12, 2022Updated 4 years ago
- ☆33Apr 2, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆17Dec 9, 2020Updated 5 years ago
- ☆46Jun 19, 2024Updated last year
- G3: A Programmable GNN Training System on GPU☆43Aug 29, 2020Updated 5 years ago
- A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs☆30Nov 29, 2023Updated 2 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆111Jun 10, 2024Updated last year
- ☆21Aug 21, 2023Updated 2 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆48Apr 9, 2016Updated 10 years ago
- A library of GPU kernels for sparse matrix operations.☆286Nov 24, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CUDAAdvisor: a GPU profiling tool☆53Aug 24, 2018Updated 7 years ago
- FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…☆38Oct 5, 2025Updated 6 months ago
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆46Aug 2, 2025Updated 8 months ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Feb 5, 2019Updated 7 years ago
- A Framework for Graph Sampling and Random Walk on GPUs.☆38Feb 3, 2025Updated last year
- An efficient concurrent graph processing system☆46Oct 27, 2021Updated 4 years ago
- A new memory mapping interface for efficient direct user-space access to byte-addressable storage, published in MICRO2022.☆15Sep 29, 2022Updated 3 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆92Nov 23, 2022Updated 3 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code for the evaluated benchmarks and proposed cache management technique, GRASP, in [Faldu et al., HPCA'20].☆18Jan 23, 2020Updated 6 years ago
- General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)☆12Apr 8, 2021Updated 5 years ago
- ☆50Jun 27, 2019Updated 6 years ago
- ☆10Mar 8, 2025Updated last year
- Some "Formula Translations" for Yousef Saad's book "Iterative Methods for Sparse Linear Systems (2nd Edition)"☆13Jan 14, 2018Updated 8 years ago
- mini is mini☆20Jan 19, 2020Updated 6 years ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Sep 24, 2023Updated 2 years ago