flame / blisLinks
BLAS-like Library Instantiation Software Framework
☆2,460Updated this week
Alternatives and similar repositories for blis
Users that are interested in blis are comparing it to the libraries listed below
Sorting:
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,840Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆880Updated this week
- High-performance object-based library for DLA computations☆243Updated last year
- Tuned OpenCL BLAS☆1,116Updated 2 months ago
- oneAPI Math Library (oneMath)☆690Updated last week
- A lightweight high performance tensor algebra framework for modern C++☆791Updated last year
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,303Updated 2 months ago
- HIP: C++ Heterogeneous-Compute Interface for Portability☆4,096Updated this week
- a software library containing BLAS functions written in OpenCL☆856Updated 10 months ago
- LAPACK development repository☆1,657Updated last week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,197Updated this week
- ☆1,890Updated last year
- Patterns and behaviors for GPU computing☆1,725Updated 3 years ago
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆732Updated 2 months ago
- ArrayFire: a general purpose GPU library.☆4,729Updated this week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,755Updated last year
- CUDA Core Compute Libraries☆1,711Updated this week
- BLISlab: A Sandbox for Optimizing GEMM☆529Updated 4 years ago
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,332Updated this week
- Numerical linear algebra software package☆477Updated this week
- A header-only C++ library for numerical optimization --☆789Updated last month
- Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library☆1,644Updated 2 months ago
- trying to collect all useful tutorials for famous C math and linear algebra libraries such as CBLAS, CLAPACK, GSL...☆428Updated 4 years ago
- High-performance automatic differentiation of LLVM and MLIR.☆1,413Updated this week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,413Updated last week
- Assembler for NVIDIA Maxwell architecture☆1,010Updated 2 years ago
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,974Updated last year
- Intel® Implicit SPMD Program Compiler☆2,689Updated last week
- C++ tensors with broadcasting and lazy computing☆3,546Updated 3 weeks ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆2,246Updated 3 weeks ago