uxlfoundation/oneDNN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uxlfoundation/oneDNN)

uxlfoundation / oneDNN

oneAPI Deep Neural Network Library (oneDNN)

☆4,025

Alternatives and similar repositories for oneDNN

Users that are interested in oneDNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / tvm
View on GitHub
Open Machine Learning Compiler Framework
☆13,607Updated this week
uxlfoundation / oneMath
View on GitHub
oneAPI Math Library (oneMath)
☆770Jul 13, 2026Updated last week
uxlfoundation / oneDAL
View on GitHub
oneAPI Data Analytics Library (oneDAL)
☆651Updated this week
NervanaSystems / ngraph
View on GitHub
nGraph has moved to OpenVINO
☆1,344Oct 15, 2020Updated 5 years ago
ARM-software / ComputeLibrary
View on GitHub
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…
☆3,174Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
google / gemmlowp
View on GitHub
Low-precision matrix multiplication
☆1,845Jan 29, 2024Updated 2 years ago
pytorch / glow
View on GitHub
Compiler for Neural Network hardware accelerators
☆3,321May 11, 2024Updated 2 years ago
Maratyszcza / NNPACK
View on GitHub
Acceleration package for neural networks on multi-core CPUs
☆1,709Jun 11, 2024Updated 2 years ago
libxsmm / libxsmm
View on GitHub
Library for specialized dense and sparse matrix operations, and deep learning primitives.
☆968Updated this week
pytorch / FBGEMM
View on GitHub
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,571Updated this week
uxlfoundation / oneCCL
View on GitHub
oneAPI Collective Communications Library (oneCCL)
☆268Updated this week
NVIDIA / TensorRT
View on GitHub
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…
☆13,182Jul 7, 2026Updated 2 weeks ago
openvinotoolkit / openvino
View on GitHub
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
☆10,564Updated this week
intel / intel-extension-for-pytorch
View on GitHub
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
☆2,014Mar 30, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
flame / how-to-optimize-gemm
View on GitHub
☆2,022Jul 29, 2023Updated 2 years ago
onnx / onnx
View on GitHub
Open standard for machine learning interoperability
☆21,211Updated this week
microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
intel / caffe
View on GitHub
This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…
☆848Aug 4, 2022Updated 3 years ago
OpenMathLib / OpenBLAS
View on GitHub
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
☆7,529Updated this week
NVIDIA / cutlass
View on GitHub
CUDA Templates and Python DSLs for High-Performance Linear Algebra
☆10,123Updated this week
intel / clDNN
View on GitHub
Compute Library for Deep Neural Networks (clDNN)
☆576Jan 10, 2023Updated 3 years ago
iree-org / iree
View on GitHub
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,853Updated this week
pytorch / QNNPACK
View on GitHub
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
☆1,550Aug 28, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
triton-lang / triton
View on GitHub
Development repository for the Triton language and compiler
☆19,778Updated this week
dmlc / dlpack
View on GitHub
common in-memory tensor structure
☆1,232Jun 19, 2026Updated last month
apache / mxnet
View on GitHub
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…
☆20,820Oct 25, 2023Updated 2 years ago
uxlfoundation / oneTBB
View on GitHub
oneAPI Threading Building Blocks (oneTBB)
☆6,702Updated this week
llvm / torch-mlir
View on GitHub
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,872Updated this week
halide / Halide
View on GitHub
a language for fast, portable data-parallel computation
☆6,568Updated this week
herumi / xbyak
View on GitHub
A JIT assembler for x86/x64 architectures supporting FPU, MMX, SSE (1-4), AVX (1-2, 512), APX, and AVX10.2
☆2,260Jul 14, 2026Updated last week
NVIDIA / nccl
View on GitHub
Optimized primitives for collective multi-GPU communication
☆4,904Updated this week
dmlc / nnvm
View on GitHub
☆1,650Sep 11, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
google / XNNPACK
View on GitHub
High-efficiency floating-point neural network inference operators for mobile, server, and Web
☆2,403Updated this week
Tencent / ncnn
View on GitHub
ncnn is a high-performance neural network inference framework optimized for the mobile platform
☆23,580Updated this week
uxlfoundation / oneDPL
View on GitHub
oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html
☆778Updated this week
horovod / horovod
View on GitHub
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
☆14,693Jun 20, 2026Updated last month
microsoft / onnxruntime
View on GitHub
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
☆21,185Updated this week
intel / MLSL
View on GitHub
Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…
☆108Jan 7, 2023Updated 3 years ago