pnnl/s-blas

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pnnl/s-blas)

pnnl / s-blas

This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Triangular-Solve (SpTRSV), Sparse-Matrix-Transposition (SpTrans) and Sparse-Matrix-Matrix-Multiplication (SpMM) for Single-node Multi-GPU (scale-up) platforms such as NVIDIA DGX-1 and DGX-2.

☆29

Alternatives and similar repositories for s-blas

Users that are interested in s-blas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pnnl / HiParTI
View on GitHub
☆17Apr 8, 2021Updated 5 years ago
uuudown / SBNN
View on GitHub
Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)
☆17Dec 9, 2020Updated 5 years ago
weifengliu-ssslab / Benchmark_SpTRSV_using_CSC
View on GitHub
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)
☆23Feb 14, 2020Updated 6 years ago
AlphaSparse / Library
View on GitHub
A sparse BLAS lib supporting multiple backends
☆51Mar 18, 2026Updated 4 months ago
hclhkbu / gcoospdm
View on GitHub
Sparse-dense matrix-matrix multiplication on GPUs
☆14Oct 15, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
aneesh297 / Sparse-Matrix-Vector-Multiplication
View on GitHub
SpMV using CUDA
☆20Mar 5, 2018Updated 8 years ago
c-f-h / ilupp
View on GitHub
A C++/Python library for incomplete LU factorizations based on Jan Mayer's ILU++
☆35Oct 1, 2021Updated 4 years ago
pnnl / TCBNN
View on GitHub
☆39Jul 25, 2022Updated 3 years ago
pnnl / nwqbench
View on GitHub
☆13Jul 18, 2024Updated 2 years ago
dumerrill / merge-spmv
View on GitHub
☆99Feb 10, 2017Updated 9 years ago
sstsimulator / sst-macro
View on GitHub
SST Macro Element Library
☆37May 12, 2026Updated 2 months ago
argonne-lcf / alcl
View on GitHub
Argonne Leadership Computing Facility OpenCL tutorial
☆10Aug 22, 2025Updated 10 months ago
tgmattso / GraphBLAS
View on GitHub
Materials for a GraphBLAS tutorial
☆17Oct 4, 2019Updated 6 years ago
ARM-software / HPCG_for_Arm
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
OpenCMISS-Dependencies / pastix
View on GitHub
PaStiX (Parallel Sparse matriX package) solver library
☆20Nov 20, 2018Updated 7 years ago
NVIDIA / nvbench_demo
View on GitHub
Simple starter CMake project that uses NVBench.
☆15May 6, 2025Updated last year
monkey2000 / spv8-public
View on GitHub
SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.
☆29Mar 16, 2021Updated 5 years ago
pnnl / arena
View on GitHub
The programming runtime and interfaces for ARENA.
☆14Sep 14, 2021Updated 4 years ago
doctorvanmartin / homeassistant-ariston-sensor
View on GitHub
Ariston Net integration with home assistant
☆10Nov 3, 2020Updated 5 years ago
SuperScientificSoftwareLaboratory / PanguLU
View on GitHub
PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems
☆49Jun 25, 2026Updated 3 weeks ago
MatanHamilis / one_stencil
View on GitHub
Multiple 1-stencil implementations using nvidia cuda.
☆12Dec 2, 2017Updated 8 years ago
avr-aics-riken / PMlib
View on GitHub
Performance Monitor library - This library records execution performance of a user code and reports the summary. The PMlib is able to use…
☆11Mar 21, 2023Updated 3 years ago
kunpengcompute / Kunpeng
View on GitHub
Welcome to the Kunpeng compute community. Find the right repo where you want to create an issue.
☆73Feb 14, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
amd / aocl-sparse
View on GitHub
AMD optimized Sparse Linear Algebra library
☆36Updated this week
pnnl / qasmtrans
View on GitHub
A C++ based quantum transpiler for NISQ devices
☆31Dec 3, 2025Updated 7 months ago
SuperScientificSoftwareLaboratory / TileSpMV
View on GitHub
Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…
☆13Aug 12, 2022Updated 3 years ago
ROCm / rocSPARSE
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆135Jul 9, 2026Updated last week
HPAC / ReLAPACK
View on GitHub
Recursive LAPACK Collection
☆44Feb 20, 2022Updated 4 years ago
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago
luuhwy / VNEC
View on GitHub
VNEC: A Vectorized Non-Empty Column Format for SpMV on cross-platform multicore CPUs
☆10Feb 6, 2024Updated 2 years ago
uuudown / QASMBench
View on GitHub
QASMBench is an OpenQASM benchmark suite running on IBM Quantum-Experience backends.
☆26Jun 10, 2021Updated 5 years ago
KTH-ScaLab / nmo
View on GitHub
A memory-centric profiling tool suite for heterogeneous memory
☆10Nov 13, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
owensgroup / merge-spmm
View on GitHub
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆73Oct 5, 2020Updated 5 years ago
josiahwsmith10 / Introduction-to-MIMO-FMCW-Radar
View on GitHub
Introduction to MIMO-FMCW Radar with MATLAB Examples
☆11Aug 25, 2023Updated 2 years ago
solomy / Macro-for-PUBG
View on GitHub
绝地求生鼠标宏 by AHK
☆10Feb 20, 2018Updated 8 years ago
eyalroz / gpu-kernel-runner
View on GitHub
Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line
☆26Jun 10, 2026Updated last month
pnnl / nwqec
View on GitHub
NWQEC: A toolkit for fault-tolerant quantum circuit transpilation and T-count optimization.
☆21Updated this week
JimZeyuYang / GPU_Power_Benchmark
View on GitHub
Microbenchmark that unveals the mechanisms behind power readings reported by nvidia-smi on your NVIDIA GPU.
☆15Dec 12, 2024Updated last year
shaayaansayed / py-ego
View on GitHub
Efficient Global Optimization
☆10Feb 26, 2016Updated 10 years ago