Maratyszcza / NNPACK
Acceleration package for neural networks on multi-core CPUs
☆1,687Updated 10 months ago
Alternatives and similar repositories for NNPACK:
Users that are interested in NNPACK are comparing it to the libraries listed below
- ☆1,658Updated 6 years ago
- Low-precision matrix multiplication☆1,800Updated last year
- Easy benchmarking of all publicly accessible implementations of convnets☆2,683Updated 7 years ago
- Benchmarking Deep Learning operations on different hardware☆1,083Updated 4 years ago
- A domain specific language to express machine learning workloads.☆1,759Updated 2 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,110Updated 5 years ago
- Fast Recurrent Networks Library☆577Updated 8 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,540Updated 5 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆522Updated 6 years ago
- ImageNet classification using binary Convolutional Neural Networks☆858Updated 7 years ago
- Caffe: a fast open framework for deep learning.☆672Updated 2 years ago
- Compute Library for Deep Neural Networks (clDNN)☆574Updated 2 years ago
- nGraph has moved to OpenVINO☆1,349Updated 4 years ago
- OpenCL library to train deep convolutional neural networks☆875Updated 7 years ago
- ATen: A TENsor library for C++11☆694Updated 5 years ago
- THE Deep Learning Benchmarks☆351Updated 8 years ago
- Open single and half precision gemm implementations☆381Updated 2 years ago
- Intel® Nervana™ reference deep learning framework committed to best performance on all hardware☆3,877Updated 4 years ago
- SqueezeNet: AlexNet-level accuracy with 50x fewer parameters☆2,195Updated 6 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,783Updated this week
- Some handy utility libraries and tools for the Caffe deep learning framework.☆458Updated 6 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,012Updated 6 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆800Updated 9 years ago
- This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…☆848Updated 2 years ago
- Facebook's extensions to torch/cunn.☆1,062Updated 7 years ago
- Computation Graph Toolkit☆633Updated 7 years ago
- Collective communications library with various primitives for multi-machine training.☆1,294Updated this week
- A common bricks library for building scalable and portable distributed machine learning.☆868Updated last week
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,293Updated 3 weeks ago
- Torch on steroids☆994Updated 6 years ago