A sparse BLAS lib supporting multiple backends
☆51Mar 18, 2026Updated 2 months ago
Alternatives and similar repositories for Library
Users that are interested in Library are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆33Jun 25, 2025Updated 10 months ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆23Feb 14, 2020Updated 6 years ago
- ☆14Apr 24, 2024Updated 2 years ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆17Feb 14, 2020Updated 6 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆29Jun 1, 2020Updated 5 years ago
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)☆26May 12, 2015Updated 11 years ago
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆61Jul 18, 2023Updated 2 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆111Jun 10, 2024Updated last year
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆44Jul 12, 2023Updated 2 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆35Jul 28, 2020Updated 5 years ago
- A C++/Python library for incomplete LU factorizations based on Jan Mayer's ILU++☆34Oct 1, 2021Updated 4 years ago
- This repository contains the official PyTorch implementation of MatRIS.☆38Nov 7, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆135Updated this week
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 6 years ago
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 7 years ago
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- GenDP: A Dynamic Programming Framework for Genome Sequencing Analysis☆17Jan 12, 2024Updated 2 years ago
- A library of GPU kernels for sparse matrix operations.☆288Nov 24, 2020Updated 5 years ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 6 years ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- ☆15Apr 15, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆99Feb 10, 2017Updated 9 years ago
- ☆17Jul 1, 2020Updated 5 years ago
- A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs☆56Mar 19, 2021Updated 5 years ago
- PaStiX (Parallel Sparse matriX package) solver library☆20Nov 20, 2018Updated 7 years ago
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆18Nov 6, 2025Updated 6 months ago
- A GPU algorithm for sparse matrix-matrix multiplication☆74Oct 1, 2020Updated 5 years ago
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- Numerical linear algebra software package☆597Updated this week
- ☆17May 12, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- Darwin: A co-processor for long read alignment☆16Apr 5, 2019Updated 7 years ago
- ☆41Apr 3, 2022Updated 4 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- development repository for the open earth compiler☆82Feb 19, 2021Updated 5 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆27Jun 6, 2025Updated 11 months ago