OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
☆7,289Feb 20, 2026Updated last week
Alternatives and similar repositories for OpenBLAS
Users that are interested in OpenBLAS are comparing it to the libraries listed below
Sorting:
- LAPACK development repository☆1,811Jan 20, 2026Updated last month
- BLAS-like Library Instantiation Software Framework☆2,610Nov 11, 2025Updated 3 months ago
- Open Machine Learning Compiler Framework☆13,142Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,956Updated this week
- ☆1,992Jul 29, 2023Updated 2 years ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,315Updated this week
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,116Feb 20, 2026Updated last week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Oct 25, 2023Updated 2 years ago
- Open standard for machine learning interoperability☆20,373Updated this week
- Caffe: a fast open framework for deep learning.☆34,839Jul 31, 2024Updated last year
- Seamless operability between C++11 and Python☆17,726Feb 17, 2026Updated last week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆22,819Feb 20, 2026Updated last week
- Low-precision matrix multiplication☆1,831Jan 29, 2024Updated 2 years ago
- Development repository for the Triton language and compiler☆18,460Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,998Feb 8, 2024Updated 2 years ago
- Open MPI main development repository☆2,532Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,557Feb 19, 2026Updated last week
- Acceleration package for neural networks on multi-core CPUs☆1,701Jun 11, 2024Updated last year
- ArrayFire: a general purpose GPU library.☆4,859Sep 5, 2025Updated 5 months ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆97,688Updated this week
- a language for fast, portable data-parallel computation☆6,577Updated this week
- Tuned OpenCL BLAS☆1,168Feb 1, 2026Updated 3 weeks ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.☆37,026Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆34,940Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,389Updated this week
- A toolkit for making real world machine learning and data analysis applications in C++☆14,349Feb 15, 2026Updated last week
- BLISlab: A Sandbox for Optimizing GEMM☆557Jun 17, 2021Updated 4 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,820Oct 9, 2023Updated 2 years ago
- NumPy & SciPy for GPU☆10,804Updated this week
- The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.☆1,454Feb 10, 2026Updated 2 weeks ago
- DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)☆3,023Feb 16, 2026Updated last week
- Abseil Common Libraries (C++)☆17,052Updated this week
- C++ tensors with broadcasting and lazy computing☆3,704Jan 29, 2026Updated 3 weeks ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,702Feb 13, 2026Updated 2 weeks ago
- oneAPI Math Library (oneMath)☆743Feb 19, 2026Updated last week
- Caffe2 is a lightweight, modular, and scalable deep learning framework.☆8,400Feb 7, 2023Updated 3 years ago
- header only, dependency-free deep learning framework in C++14☆6,017Apr 17, 2022Updated 3 years ago
- a software library containing BLAS functions written in OpenCL☆865Aug 2, 2024Updated last year
- A microbenchmark support library☆10,024Updated this week