oneapi-src / oneDNN

oneAPI Deep Neural Network Library (oneDNN)

☆3,742

Alternatives and similar repositories for oneDNN:

Users that are interested in oneDNN are comparing it to the libraries listed below

NervanaSystems / ngraph
nGraph has moved to OpenVINO
☆1,350Updated 4 years ago
google / gemmlowp
Low-precision matrix multiplication
☆1,792Updated last year
apache / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆12,081Updated this week
NVIDIA / nccl
Optimized primitives for collective multi-GPU communication
☆3,534Updated last month
dmlc / nnvm
☆1,657Updated 6 years ago
ARM-software / ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…
☆2,928Updated this week
pytorch / FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,270Updated this week
facebookincubator / gloo
Collective communications library with various primitives for multi-machine training.
☆1,273Updated this week
pytorch / glow
Compiler for Neural Network hardware accelerators
☆3,269Updated 9 months ago
pytorch / QNNPACK
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
☆1,536Updated 5 years ago
mlcommons / training
Reference implementations of MLPerf™ training benchmarks
☆1,648Updated last month
Maratyszcza / NNPACK
Acceleration package for neural networks on multi-core CPUs
☆1,684Updated 8 months ago
facebookresearch / TensorComprehensions
A domain specific language to express machine learning workloads.
☆1,755Updated last year
tensorflow / mlir
"Multi-Level Intermediate Representation" Compiler Infrastructure
☆1,738Updated 3 years ago
OpenMathLib / OpenBLAS
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
☆6,619Updated this week
intel / caffe
This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…
☆847Updated 2 years ago
baidu-research / DeepBench
Benchmarking Deep Learning operations on different hardware
☆1,081Updated 3 years ago
onnx / onnx-tensorflow
Tensorflow Backend for ONNX
☆1,296Updated 11 months ago
onnx / tutorials
Tutorials for creating and using ONNX models
☆3,465Updated 7 months ago
ARM-software / armnn
Arm NN ML Software. The code here is a read-only mirror of https://review.mlplatform.org/admin/repos/ml/armnn
☆1,238Updated this week
onnx / onnx
Open standard for machine learning interoperability
☆18,575Updated this week
microsoft / MMdnn
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…
☆5,807Updated 9 months ago
clab / dynet
DyNet: The Dynamic Neural Network Toolkit
☆3,426Updated last year
halide / Halide
a language for fast, portable data-parallel computation
☆5,985Updated this week
NVIDIA / cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
☆1,731Updated last year
libxsmm / libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
☆865Updated this week
tensorflow / runtime
A performant and modular runtime for TensorFlow
☆759Updated 2 weeks ago
microsoft / nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆978Updated 5 months ago
onnx / onnxmltools
ONNXMLTools enables conversion of models to ONNX
☆1,055Updated 2 months ago
mlcommons / inference
Reference implementations of MLPerf™ inference benchmarks
☆1,326Updated this week