pnnl / s-blasView external linksLinks
This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Triangular-Solve (SpTRSV), Sparse-Matrix-Transposition (SpTrans) and Sparse-Matrix-Matrix-Multiplication (SpMM) for Single-node Multi-GPU (scale-up) platforms such as NVIDIA DGX-1 and DGX-2.
☆28Jun 1, 2020Updated 5 years ago
Alternatives and similar repositories for s-blas
Users that are interested in s-blas are comparing it to the libraries listed below
Sorting:
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆22Feb 14, 2020Updated 6 years ago
- The SparseX sparse kernel optimization library☆43Jan 16, 2019Updated 7 years ago
- A sparse BLAS lib supporting multiple backends☆49Nov 23, 2025Updated 2 months ago
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆16Dec 9, 2020Updated 5 years ago
- A C++/Python library for incomplete LU factorizations based on Jan Mayer's ILU++☆33Oct 1, 2021Updated 4 years ago
- ☆17Apr 8, 2021Updated 4 years ago
- PaStiX (Parallel Sparse matriX package) solver library☆20Nov 20, 2018Updated 7 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Oct 15, 2018Updated 7 years ago
- ☆98Feb 10, 2017Updated 9 years ago
- AMD optimized Sparse Linear Algebra library☆35Jan 4, 2026Updated last month
- A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs☆56Mar 19, 2021Updated 4 years ago
- autonomous driving contest reference kit☆10Dec 2, 2021Updated 4 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 4 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆134Jan 21, 2026Updated 3 weeks ago
- Multilevel Directed Acyclic Graph Partitioner☆35Apr 13, 2022Updated 3 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Jan 15, 2026Updated 3 weeks ago
- C#/.NET bindings for DSS C-API, an unofficial implementation of OpenDSS with a custom API in plain C, new features and API extensions.☆10Mar 15, 2024Updated last year
- This place provide different SRAM cells netlist to be simulated with HSpice tool in sub-20nm FinFET technologies.☆12Dec 31, 2020Updated 5 years ago
- Code for paper: Localized matrix factorization for recommendation based on matrix block diagonal forms☆10Jan 27, 2015Updated 11 years ago
- example project to test COM programming in the Lazarus IDE☆10Dec 1, 2014Updated 11 years ago
- Fast Sparse Multifrontal Solver☆11May 27, 2015Updated 10 years ago
- Library of High Precision Sparse Matrix Operations Accelerated by SIMD☆44Jun 18, 2021Updated 4 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Feb 5, 2019Updated 7 years ago
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆45Aug 2, 2025Updated 6 months ago
- Sparse matrix and vector classes, solvers. This is a mirror repository - development happens on https://gitlab.dune-project.org/☆11Feb 6, 2026Updated last week
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Jun 23, 2016Updated 9 years ago
- An experimental lexer and parser generator☆10Jul 31, 2018Updated 7 years ago
- ☆11Jan 21, 2026Updated 3 weeks ago
- FastImp is a wideband impedance extraction program for 3D geometries☆13Nov 6, 2017Updated 8 years ago
- ☆10Jun 28, 2019Updated 6 years ago
- ☆15Apr 6, 2016Updated 9 years ago
- Parallel high performance C++ containers (set and map)☆16Feb 25, 2024Updated last year
- Software☆10Dec 5, 2024Updated last year
- Memory Compiler Tutorial☆14Oct 7, 2020Updated 5 years ago
- A low-overhead, task-based threading API using a thread-pool of C++11 threads☆11Oct 29, 2018Updated 7 years ago
- Python Verilog-AMS Parser☆12Oct 13, 2015Updated 10 years ago
- Verilog-A implementation of MOSFET model BSIM4.8☆15Oct 4, 2019Updated 6 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 2 years ago
- Examples from the Openlane repository, adapted as Fusesoc cores☆12May 18, 2021Updated 4 years ago