A sparse BLAS lib supporting multiple backends
☆51Mar 18, 2026Updated last month
Alternatives and similar repositories for Library
Users that are interested in Library are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆32Jun 25, 2025Updated 10 months ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆23Feb 14, 2020Updated 6 years ago
- ☆14Apr 24, 2024Updated 2 years ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆17Feb 14, 2020Updated 6 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆29Jun 1, 2020Updated 5 years ago
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)☆26May 12, 2015Updated 10 years ago
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆60Jul 18, 2023Updated 2 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆111Jun 10, 2024Updated last year
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆44Jul 12, 2023Updated 2 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆35Jul 28, 2020Updated 5 years ago
- A C++/Python library for incomplete LU factorizations based on Jan Mayer's ILU++☆34Oct 1, 2021Updated 4 years ago
- This repository contains the official PyTorch implementation of MatRIS.☆37Nov 7, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆135Apr 22, 2026Updated last week
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 6 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- GenDP: A Dynamic Programming Framework for Genome Sequencing Analysis☆17Jan 12, 2024Updated 2 years ago
- A library of GPU kernels for sparse matrix operations.☆286Nov 24, 2020Updated 5 years ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 6 years ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Apr 15, 2022Updated 4 years ago
- ☆99Feb 10, 2017Updated 9 years ago
- ☆17Jul 1, 2020Updated 5 years ago
- A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs☆56Mar 19, 2021Updated 5 years ago
- PaStiX (Parallel Sparse matriX package) solver library☆20Nov 20, 2018Updated 7 years ago
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆17Nov 6, 2025Updated 5 months ago
- A New Format for SIMD-accelerated SpMV☆22Apr 4, 2022Updated 4 years ago
- A GPU algorithm for sparse matrix-matrix multiplication☆75Oct 1, 2020Updated 5 years ago
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆92Nov 23, 2022Updated 3 years ago
- ☆17May 12, 2025Updated 11 months ago
- Darwin: A co-processor for long read alignment☆16Apr 5, 2019Updated 7 years ago
- ☆41Apr 3, 2022Updated 4 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- development repository for the open earth compiler☆82Feb 19, 2021Updated 5 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆50Mar 1, 2018Updated 8 years ago