OpenMathLib / OpenBLASLinks
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
☆7,154Updated last week
Alternatives and similar repositories for OpenBLAS
Users that are interested in OpenBLAS are comparing it to the libraries listed below
Sorting:
- LAPACK development repository☆1,754Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,933Updated this week
- BLAS-like Library Instantiation Software Framework☆2,567Updated 3 weeks ago
- ArrayFire: a general purpose GPU library.☆4,837Updated 3 months ago
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,073Updated 2 weeks ago
- C++ tensors with broadcasting and lazy computing☆3,662Updated last week
- ☆1,960Updated 2 years ago
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,989Updated last year
- DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)☆2,980Updated last month
- oneAPI Threading Building Blocks (oneTBB)☆6,458Updated this week
- Open Machine Learning Compiler Framework☆12,887Updated this week
- THIS MIRROR IS DEPRECATED -- New url: https://gitlab.com/libeigen/eigen☆1,822Updated 3 years ago
- mlpack: a fast, header-only C++ machine learning library☆5,558Updated last week
- Tuned OpenCL BLAS☆1,163Updated last week
- HIP: C++ Heterogeneous-Compute Interface for Portability☆4,274Updated this week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Updated 2 years ago
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,218Updated this week
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,336Updated 7 months ago
- Low-precision matrix multiplication☆1,819Updated last year
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,488Updated this week
- NumPy aware dynamic Python compiler using LLVM☆10,769Updated last week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,808Updated 2 years ago
- a language for fast, portable data-parallel computation☆6,437Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,187Updated this week
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,401Updated 2 weeks ago
- Super-project for modularized Boost☆8,186Updated this week
- library to read/write .npy and .npz files in C/C++☆1,441Updated 2 years ago
- Optimized primitives for collective multi-GPU communication☆4,289Updated last week
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆8,902Updated this week
- Compiler for Neural Network hardware accelerators☆3,323Updated last year