Maratyszcza / NNPACKLinks
Acceleration package for neural networks on multi-core CPUs
☆1,695Updated last year
Alternatives and similar repositories for NNPACK
Users that are interested in NNPACK are comparing it to the libraries listed below
Sorting:
- ☆1,656Updated 6 years ago
- Low-precision matrix multiplication☆1,811Updated last year
- Easy benchmarking of all publicly accessible implementations of convnets☆2,681Updated 8 years ago
- Benchmarking Deep Learning operations on different hardware☆1,094Updated 4 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,119Updated 6 years ago
- Fast Recurrent Networks Library☆577Updated 8 years ago
- nGraph has moved to OpenVINO☆1,344Updated 4 years ago
- Caffe: a fast open framework for deep learning.☆671Updated 2 years ago
- A domain specific language to express machine learning workloads.☆1,760Updated 2 years ago
- OpenCL library to train deep convolutional neural networks☆874Updated 7 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆807Updated 9 years ago
- Compute Library for Deep Neural Networks (clDNN)☆574Updated 2 years ago
- This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…☆850Updated 3 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆528Updated 6 years ago
- ATen: A TENsor library for C++11☆702Updated 5 years ago
- ImageNet classification using binary Convolutional Neural Networks☆863Updated 7 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,543Updated 5 years ago
- Some handy utility libraries and tools for the Caffe deep learning framework.☆460Updated 6 years ago
- Open single and half precision gemm implementations☆383Updated 2 years ago
- A common bricks library for building scalable and portable distributed machine learning.☆877Updated last week
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆619Updated 4 years ago
- SqueezeNet: AlexNet-level accuracy with 50x fewer parameters☆2,202Updated 7 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆705Updated 6 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆268Updated 2 years ago
- Python module for performing basic dense linear algebra computations on the GPU using CUDA.☆609Updated 5 years ago
- A REST API for Caffe using Docker and Go☆419Updated 7 years ago
- THE Deep Learning Benchmarks☆351Updated 8 years ago
- Collective communications library with various primitives for multi-machine training.☆1,342Updated 3 weeks ago
- Intel® Nervana™ reference deep learning framework committed to best performance on all hardware☆3,869Updated 4 years ago
- Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1☆1,057Updated 6 years ago