ARM-software / ComputeLibraryLinks
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
☆3,101Updated last week
Alternatives and similar repositories for ComputeLibrary
Users that are interested in ComputeLibrary are comparing it to the libraries listed below
Sorting:
- Arm NN ML Software.☆1,292Updated last month
- An open optimized software library project for the ARM® Architecture☆1,525Updated 3 years ago
- Low-precision matrix multiplication☆1,827Updated last year
- oneAPI Deep Neural Network Library (oneDNN)☆3,953Updated this week
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,228Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,223Updated this week
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,551Updated 6 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,703Updated last year
- Tuned OpenCL BLAS☆1,163Updated last month
- ☆1,977Updated 2 years ago
- nGraph has moved to OpenVINO☆1,346Updated 5 years ago
- Makes ARM NEON documentation accessible (with examples)☆406Updated last year
- Compiler for Neural Network hardware accelerators☆3,327Updated last year
- Arm Machine Learning tutorials and examples☆479Updated this week
- ☆156Updated 10 months ago
- Khronos OpenCL-Headers☆745Updated last week
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆481Updated 2 months ago
- Embedded and mobile deep learning research resources☆761Updated 2 years ago
- Khronos OpenVX Tutorial Material☆246Updated 4 years ago
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,506Updated 10 months ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆535Updated 3 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,814Updated 2 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆955Updated 9 months ago
- This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…☆852Updated 3 years ago
- FeatherCNN is a high performance inference engine for convolutional neural networks.☆1,225Updated 6 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Updated 5 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,516Updated this week
- Source code examples from the Parallel Forall Blog☆1,319Updated 3 months ago
- Benchmarking Neural Network Inference on Mobile Devices☆384Updated 2 years ago
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,213Updated this week