OpenMathLib / OpenBLASLinks

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

☆6,891

Alternatives and similar repositories for OpenBLAS

Users that are interested in OpenBLAS are comparing it to the libraries listed below

Sorting:

Reference-LAPACK / lapack
LAPACK development repository
☆1,688Updated last week
uxlfoundation / oneDNN
oneAPI Deep Neural Network Library (oneDNN)
☆3,859Updated this week
flame / blis
BLAS-like Library Instantiation Software Framework
☆2,479Updated 2 weeks ago
halide / Halide
a language for fast, portable data-parallel computation
☆6,143Updated this week
arrayfire / arrayfire
ArrayFire: a general purpose GPU library.
☆4,747Updated last week
apache / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆12,510Updated this week
ARM-software / ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…
☆3,022Updated 3 weeks ago
uxlfoundation / oneTBB
oneAPI Threading Building Blocks (oneTBB)
☆6,255Updated last week
FFTW / fftw3
DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)
☆2,918Updated 6 months ago
flame / how-to-optimize-gemm
☆1,903Updated 2 years ago
Maratyszcza / NNPACK
Acceleration package for neural networks on multi-core CPUs
☆1,692Updated last year
mlpack / mlpack
mlpack: a fast, header-only C++ machine learning library
☆5,430Updated this week
xtensor-stack / xtensor
C++ tensors with broadcasting and lazy computing
☆3,575Updated 3 weeks ago
NVIDIA / thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
☆4,982Updated last year
gperftools / gperftools
Main gperftools repository
☆8,781Updated this week
CNugteren / CLBlast
Tuned OpenCL BLAS
☆1,126Updated last month
NVIDIA / nccl
Optimized primitives for collective multi-GPU communication
☆3,923Updated 2 weeks ago
pytorch / glow
Compiler for Neural Network hardware accelerators
☆3,309Updated last year
tensor-compiler / taco
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
☆1,315Updated 3 months ago
flann-lib / flann
Fast Library for Approximate Nearest Neighbors
☆2,334Updated last year
ermig1979 / Simd
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
☆2,185Updated this week
davisking / dlib
A toolkit for making real world machine learning and data analysis applications in C++
☆14,090Updated this week
tiny-dnn / tiny-dnn
header only, dependency-free deep learning framework in C++14
☆5,966Updated 3 years ago
NVIDIA / cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
☆1,768Updated last year
rogersce / cnpy
library to read/write .npy and .npz files in C/C++
☆1,407Updated 2 years ago
eigenteam / eigen-git-mirror
THIS MIRROR IS DEPRECATED -- New url: https://gitlab.com/libeigen/eigen
☆1,814Updated 3 years ago
apache / mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…
☆20,815Updated last year
google / gemmlowp
Low-precision matrix multiplication
☆1,811Updated last year
openxla / xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
☆3,399Updated this week
xtensor-stack / xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
☆2,458Updated this week