OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
☆7,327Mar 12, 2026Updated this week
Alternatives and similar repositories for OpenBLAS
Users that are interested in OpenBLAS are comparing it to the libraries listed below
Sorting:
- LAPACK development repository☆1,814Mar 7, 2026Updated last week
- BLAS-like Library Instantiation Software Framework☆2,612Nov 11, 2025Updated 4 months ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,962Updated this week
- Open Machine Learning Compiler Framework☆13,174Updated this week
- ☆1,995Jul 29, 2023Updated 2 years ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,389Mar 7, 2026Updated last week
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,121Updated this week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,827Oct 25, 2023Updated 2 years ago
- Open standard for machine learning interoperability☆20,445Mar 7, 2026Updated last week
- Caffe: a fast open framework for deep learning.☆34,770Jul 31, 2024Updated last year
- Seamless operability between C++11 and Python☆17,757Mar 7, 2026Updated last week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆22,868Updated this week
- Low-precision matrix multiplication☆1,832Jan 29, 2024Updated 2 years ago
- Development repository for the Triton language and compiler☆18,656Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,997Feb 8, 2024Updated 2 years ago
- Open MPI main development repository☆2,543Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,582Updated this week
- Acceleration package for neural networks on multi-core CPUs☆1,702Jun 11, 2024Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆98,039Mar 8, 2026Updated last week
- ArrayFire: a general purpose GPU library.☆4,868Mar 7, 2026Updated last week
- a language for fast, portable data-parallel computation☆6,591Updated this week
- Tuned OpenCL BLAS☆1,169Feb 1, 2026Updated last month
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.☆37,350Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,038Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,497Updated this week
- A toolkit for making real world machine learning and data analysis applications in C++☆14,362Updated this week
- BLISlab: A Sandbox for Optimizing GEMM☆559Jun 17, 2021Updated 4 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,820Oct 9, 2023Updated 2 years ago
- NumPy & SciPy for GPU☆10,824Mar 7, 2026Updated last week
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,461Feb 10, 2026Updated last month
- DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)☆3,039Feb 16, 2026Updated 3 weeks ago
- Abseil Common Libraries (C++)☆17,100Updated this week
- C++ tensors with broadcasting and lazy computing☆3,712Jan 29, 2026Updated last month
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,782Updated this week
- oneAPI Math Library (oneMath)☆747Updated this week
- Caffe2 is a lightweight, modular, and scalable deep learning framework.☆8,398Feb 7, 2023Updated 3 years ago
- header only, dependency-free deep learning framework in C++14☆6,020Apr 17, 2022Updated 3 years ago
- a software library containing BLAS functions written in OpenCL☆865Aug 2, 2024Updated last year
- A microbenchmark support library☆10,043Updated this week