OpenMathLib / OpenBLASLinks
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
☆6,950Updated this week
Alternatives and similar repositories for OpenBLAS
Users that are interested in OpenBLAS are comparing it to the libraries listed below
Sorting:
- LAPACK development repository☆1,699Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,868Updated this week
- BLAS-like Library Instantiation Software Framework☆2,493Updated last week
- DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)☆2,933Updated 6 months ago
- ArrayFire: a general purpose GPU library.☆4,768Updated last month
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,987Updated last year
- a language for fast, portable data-parallel computation☆6,163Updated this week
- ☆1,912Updated 2 years ago
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,033Updated this week
- THIS MIRROR IS DEPRECATED -- New url: https://gitlab.com/libeigen/eigen☆1,812Updated 3 years ago
- Tuned OpenCL BLAS☆1,133Updated last week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,778Updated last year
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆7,988Updated 3 weeks ago
- C++ tensors with broadcasting and lazy computing☆3,597Updated last month
- CUDA Core Compute Libraries☆1,880Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,554Updated this week
- Optimized primitives for collective multi-GPU communication☆3,980Updated 2 weeks ago
- Source code examples from the Parallel Forall Blog☆1,301Updated last year
- library to read/write .npy and .npz files in C/C++☆1,410Updated 2 years ago
- HIP: C++ Heterogeneous-Compute Interface for Portability☆4,159Updated this week
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,344Updated last week
- oneAPI Threading Building Blocks (oneTBB)☆6,308Updated this week
- Acceleration package for neural networks on multi-core CPUs☆1,696Updated last year
- CUDA integration for Python, plus shiny features☆1,980Updated 2 months ago
- CUDA Library Samples☆2,069Updated last week
- GNU Scientific Library with CMake build support and AMPL bindings☆597Updated 5 months ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,318Updated 4 months ago
- Open standard for machine learning interoperability☆19,472Updated last week
- Low-precision matrix multiplication☆1,813Updated last year
- mlpack: a fast, header-only C++ machine learning library☆5,460Updated this week