This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Triangular-Solve (SpTRSV), Sparse-Matrix-Transposition (SpTrans) and Sparse-Matrix-Matrix-Multiplication (SpMM) for Single-node Multi-GPU (scale-up) platforms such as NVIDIA DGX-1 and DGX-2.
☆28Jun 1, 2020Updated 5 years ago
Alternatives and similar repositories for s-blas
Users that are interested in s-blas are comparing it to the libraries listed below
Sorting:
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆23Feb 14, 2020Updated 6 years ago
- The SparseX sparse kernel optimization library☆43Jan 16, 2019Updated 7 years ago
- A sparse BLAS lib supporting multiple backends☆51Nov 23, 2025Updated 3 months ago
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆17Dec 9, 2020Updated 5 years ago
- A C++/Python library for incomplete LU factorizations based on Jan Mayer's ILU++☆34Oct 1, 2021Updated 4 years ago
- ☆17Apr 8, 2021Updated 4 years ago
- PaStiX (Parallel Sparse matriX package) solver library☆20Nov 20, 2018Updated 7 years ago
- A sort wrapper enabling both use of random-access sorting on non-random access containers, and increased performance for the sorting of l …☆21Jul 11, 2025Updated 8 months ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Oct 15, 2018Updated 7 years ago
- SpMV using CUDA☆20Mar 5, 2018Updated 8 years ago
- ☆98Feb 10, 2017Updated 9 years ago
- AMD optimized Sparse Linear Algebra library☆35Jan 4, 2026Updated 2 months ago
- ☆29Dec 16, 2022Updated 3 years ago
- A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs☆56Mar 19, 2021Updated 4 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆69Sep 12, 2018Updated 7 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Aug 21, 2020Updated 5 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 4 years ago
- ☆37Jul 25, 2022Updated 3 years ago
- This is the repository containing the implementation of sparse dense matrix multiplication for the matrix dimension of 560 x 560.☆10Jul 7, 2021Updated 4 years ago
- Multilevel Directed Acyclic Graph Partitioner☆35Apr 13, 2022Updated 3 years ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- Ariston Net integration with home assistant☆10Nov 3, 2020Updated 5 years ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆11Jan 25, 2016Updated 10 years ago
- Code for paper: Localized matrix factorization for recommendation based on matrix block diagonal forms☆10Jan 27, 2015Updated 11 years ago
- This place provide different SRAM cells netlist to be simulated with HSpice tool in sub-20nm FinFET technologies.☆12Dec 31, 2020Updated 5 years ago
- example project to test COM programming in the Lazarus IDE☆11Dec 1, 2014Updated 11 years ago
- ☆17Jul 18, 2022Updated 3 years ago
- Performance Monitor library - This library records execution performance of a user code and reports the summary. The PMlib is able to use…☆11Mar 21, 2023Updated 2 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Mar 8, 2026Updated last week
- Library of High Precision Sparse Matrix Operations Accelerated by SIMD☆44Jun 18, 2021Updated 4 years ago
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆44Jul 12, 2023Updated 2 years ago
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆45Aug 2, 2025Updated 7 months ago
- List of resources about modern dynamic polymorphism in C++.☆12Sep 29, 2018Updated 7 years ago
- Synthesiser for Asynchronous Verilog Language☆20Oct 29, 2014Updated 11 years ago
- Aries Network Performance Counters Monitoring Library☆11Nov 19, 2020Updated 5 years ago
- VNEC: A Vectorized Non-Empty Column Format for SpMV on cross-platform multicore CPUs☆10Feb 6, 2024Updated 2 years ago
- Datasets of audio adversarial examples for deep speech recognition systems and Python code of a detection system☆13May 6, 2023Updated 2 years ago
- A memory-centric profiling tool suite for heterogeneous memory☆11Nov 13, 2024Updated last year
- ☆15Apr 6, 2016Updated 9 years ago