flame / blisLinks
BLAS-like Library Instantiation Software Framework
☆2,600Updated 2 months ago
Alternatives and similar repositories for blis
Users that are interested in blis are comparing it to the libraries listed below
Sorting:
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆933Updated 3 weeks ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,348Updated 9 months ago
- Tuned OpenCL BLAS☆1,164Updated this week
- ☆1,984Updated 2 years ago
- Patterns and behaviors for GPU computing☆1,764Updated 3 weeks ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,817Updated 2 years ago
- An efficient C++20 GPU numerical computing library with Python-like syntax☆1,402Updated this week
- oneAPI Math Library (oneMath)☆740Updated 3 weeks ago
- LAPACK development repository☆1,795Updated 2 weeks ago
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,253Updated last week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆804Updated last month
- CUDA Core Compute Libraries☆2,162Updated this week
- pocl - Portable Computing Language☆1,048Updated this week
- ArrayFire: a general purpose GPU library.☆4,854Updated 5 months ago
- trying to collect all useful tutorials for famous C math and linear algebra libraries such as CBLAS, CLAPACK, GSL...☆438Updated last week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,998Updated last year
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,443Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,310Updated 2 years ago
- A lightweight high performance tensor algebra framework for modern C++☆830Updated 6 months ago
- Low-precision matrix multiplication☆1,832Updated 2 years ago
- High-performance automatic differentiation of LLVM and MLIR.☆1,532Updated this week
- Source code examples from the Parallel Forall Blog☆1,322Updated 4 months ago
- a software library containing BLAS functions written in OpenCL☆863Updated last year
- CUDA Library Samples☆2,306Updated 2 weeks ago
- A header-only C++ library for numerical optimization --☆805Updated last month
- Assembler for NVIDIA Maxwell architecture☆1,060Updated 3 years ago
- Vector class library, latest version☆1,431Updated 2 years ago
- Programmable CUDA/C++ GPU Graph Analytics☆1,063Updated last week
- Intel® Implicit SPMD Program Compiler☆2,839Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,435Updated last week