flame / blisLinks
BLAS-like Library Instantiation Software Framework
☆2,571Updated last month
Alternatives and similar repositories for blis
Users that are interested in blis are comparing it to the libraries listed below
Sorting:
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,338Updated 8 months ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆927Updated last week
- LAPACK development repository☆1,766Updated last week
- trying to collect all useful tutorials for famous C math and linear algebra libraries such as CBLAS, CLAPACK, GSL...☆437Updated 4 years ago
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆793Updated this week
- Tuned OpenCL BLAS☆1,162Updated 3 weeks ago
- Patterns and behaviors for GPU computing☆1,754Updated 3 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,809Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,404Updated this week
- ☆1,968Updated 2 years ago
- ArrayFire: a general purpose GPU library.☆4,842Updated 3 months ago
- Assembler for NVIDIA Maxwell architecture☆1,058Updated 2 years ago
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,181Updated this week
- A lightweight high performance tensor algebra framework for modern C++☆826Updated 5 months ago
- automatic differentiation made easier for C++☆1,889Updated 10 months ago
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,412Updated last week
- High-performance automatic differentiation of LLVM and MLIR.☆1,513Updated last week
- oneAPI Math Library (oneMath)☆735Updated 2 weeks ago
- Numerical linear algebra software package☆539Updated last week
- An efficient C++20 GPU numerical computing library with Python-like syntax☆1,373Updated last week
- CUDA Core Compute Libraries☆2,087Updated this week
- Source code examples from the Parallel Forall Blog☆1,312Updated 3 months ago
- Intel® Implicit SPMD Program Compiler☆2,811Updated this week
- BLISlab: A Sandbox for Optimizing GEMM☆552Updated 4 years ago
- Programmable CUDA/C++ GPU Graph Analytics☆1,049Updated last year
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE, WebAssembly, VSX, RISC-…☆2,565Updated 3 weeks ago
- a software library containing BLAS functions written in OpenCL☆862Updated last year
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,758Updated this week
- The Legion Parallel Programming System☆748Updated last week
- HIP: C++ Heterogeneous-Compute Interface for Portability☆4,290Updated last week