OpenMathLib / OpenBLAS
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
☆6,757Updated this week
Alternatives and similar repositories for OpenBLAS
Users that are interested in OpenBLAS are comparing it to the libraries listed below
Sorting:
- LAPACK development repository☆1,634Updated this week
- BLAS-like Library Instantiation Software Framework☆2,434Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,788Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,259Updated this week
- a language for fast, portable data-parallel computation☆6,055Updated this week
- C++ tensors with broadcasting and lazy computing☆3,513Updated 2 weeks ago
- DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)☆2,858Updated 3 months ago
- mlpack: a fast, header-only C++ machine learning library☆5,347Updated last week
- ☆1,868Updated last year
- ArrayFire: a general purpose GPU library.☆4,692Updated this week
- Optimized primitives for collective multi-GPU communication☆3,700Updated last week
- Open standard for machine learning interoperability☆18,915Updated this week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,747Updated last year
- Tuned OpenCL BLAS☆1,103Updated 2 weeks ago
- THIS MIRROR IS DEPRECATED -- New url: https://gitlab.com/libeigen/eigen☆1,806Updated 3 years ago
- Source code examples from the Parallel Forall Blog☆1,284Updated 9 months ago
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,119Updated this week
- Low-precision matrix multiplication☆1,803Updated last year
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,959Updated last year
- oneAPI Math Library (oneMath)☆672Updated last week
- Caffe: a fast open framework for deep learning.☆34,348Updated 9 months ago
- HIP: C++ Heterogeneous-Compute Interface for Portability☆3,999Updated this week
- header only, dependency-free deep learning framework in C++14☆5,921Updated 3 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,687Updated 11 months ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,111Updated 5 years ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,294Updated 3 weeks ago
- a software library containing BLAS functions written in OpenCL☆853Updated 9 months ago
- Patterns and behaviors for GPU computing☆1,709Updated 2 years ago
- NumPy & SciPy for GPU☆10,195Updated last week
- Efficiently computes derivatives of NumPy code.☆7,259Updated last week