flame / blisLinks
BLAS-like Library Instantiation Software Framework
☆2,511Updated last week
Alternatives and similar repositories for blis
Users that are interested in blis are comparing it to the libraries listed below
Sorting:
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆900Updated last week
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,978Updated last week
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,323Updated 5 months ago
- Tuned OpenCL BLAS☆1,142Updated this week
- LAPACK development repository☆1,712Updated this week
- ☆1,919Updated 2 years ago
- oneAPI Math Library (oneMath)☆714Updated last month
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,697Updated last week
- trying to collect all useful tutorials for famous C math and linear algebra libraries such as CBLAS, CLAPACK, GSL...☆435Updated 4 years ago
- ArrayFire: a general purpose GPU library.☆4,781Updated 2 weeks ago
- Patterns and behaviors for GPU computing☆1,738Updated 3 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,786Updated last year
- CUDA Core Compute Libraries☆1,913Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,351Updated last week
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,360Updated last week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆756Updated 2 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,323Updated this week
- a software library containing BLAS functions written in OpenCL☆861Updated last year
- A lightweight high performance tensor algebra framework for modern C++☆803Updated 2 months ago
- pocl - Portable Computing Language☆1,024Updated this week
- Assembler for NVIDIA Maxwell architecture☆1,024Updated 2 years ago
- automatic differentiation made easier for C++☆1,843Updated 7 months ago
- Source code examples from the Parallel Forall Blog☆1,302Updated last year
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,484Updated last week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,986Updated last year
- High-performance automatic differentiation of LLVM and MLIR.☆1,464Updated this week
- Numerical linear algebra software package☆508Updated this week
- Intel® Implicit SPMD Program Compiler☆2,750Updated this week
- BLISlab: A Sandbox for Optimizing GEMM☆538Updated 4 years ago
- High-performance object-based library for DLA computations☆247Updated last year