Source code of the paper "OpSparse: a Highly Optimized Framework for Sparse General Matrix Multiplication on GPUs"
☆15Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for OpSparse
Users that are interested in OpSparse are comparing it to the libraries listed below
Sorting:
- Efficient SpGEMM on GPU using CUDA and CSR☆59Jul 18, 2023Updated 2 years ago
- The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow☆47Jan 26, 2023Updated 3 years ago
- 稀疏矩阵-向量乘的并行优化算法(OpenMP,AVX)☆11Jul 7, 2021Updated 4 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆91Nov 23, 2022Updated 3 years ago
- Classes in C++ for building applications☆14Feb 16, 2026Updated last week
- ☆11Apr 16, 2023Updated 2 years ago
- ☆23Dec 30, 2025Updated 2 months ago
- ☆10Mar 2, 2024Updated last year
- 软件工程与计算II☆11Dec 29, 2020Updated 5 years ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆46May 22, 2024Updated last year
- ☆11Sep 30, 2023Updated 2 years ago
- Simple library for manipulating strings using OpenFST☆12Sep 26, 2021Updated 4 years ago
- Instruction Pointer Classifier and Dynamic Degree Stream based Hardware Cache Prefetching☆16Nov 16, 2019Updated 6 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆12Aug 12, 2022Updated 3 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search☆21Jul 22, 2025Updated 7 months ago
- ☆14Apr 24, 2024Updated last year
- ☆18Mar 4, 2025Updated 11 months ago
- High Performance Sorting Based Distributed memory K-mer counter☆15Dec 8, 2025Updated 2 months ago
- GenDP: A Dynamic Programming Framework for Genome Sequencing Analysis☆17Jan 12, 2024Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- ☆15Apr 21, 2025Updated 10 months ago
- Standardized higher-order datasets with corresponding datasheets☆18Aug 17, 2025Updated 6 months ago
- ☆13Sep 11, 2020Updated 5 years ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆22Jan 6, 2026Updated last month
- Ultra fast MSD radix sorter☆10Jun 23, 2020Updated 5 years ago
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆16Dec 9, 2020Updated 5 years ago
- Scalable radix top-k selection on GPUs.☆21Jan 27, 2025Updated last year
- ☆31Oct 21, 2025Updated 4 months ago
- ☆20Aug 21, 2023Updated 2 years ago
- Parallel SpMV using CSR representation, built in CUDA☆14Jun 27, 2020Updated 5 years ago
- Sparse matrix computation library for GPU☆59Jul 12, 2020Updated 5 years ago
- ☆14Jan 12, 2022Updated 4 years ago
- ☆22Oct 22, 2021Updated 4 years ago
- GPU based Compressed Graph Traversal☆16Jan 9, 2026Updated last month
- CUDA-DClust+: Fast DBSCAN algorithm implemented on CUDA. Based on the research paper.☆17May 9, 2025Updated 9 months ago
- SV-Sim: Scalable PGAS-based State Vector Simulation of Quantum Circuits☆20Feb 2, 2024Updated 2 years ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Aug 20, 2025Updated 6 months ago
- C++ package to store Matrix Market (.mtx) file format sparse matrices in Compressed Row Storage (CSR) format.☆16Oct 16, 2019Updated 6 years ago