Maratyszcza / NNPACK
Acceleration package for neural networks on multi-core CPUs
☆1,676Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for NNPACK
- Low-precision matrix multiplication☆1,780Updated 9 months ago
- ☆1,655Updated 6 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,683Updated 7 years ago
- Benchmarking Deep Learning operations on different hardware☆1,074Updated 3 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,110Updated 5 years ago
- OpenCL library to train deep convolutional neural networks☆869Updated 6 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆518Updated 6 years ago
- A domain specific language to express machine learning workloads.☆1,761Updated last year
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,528Updated 5 years ago
- Fast Recurrent Networks Library☆576Updated 8 years ago
- This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…☆848Updated 2 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆784Updated 8 years ago
- ATen: A TENsor library for C++11☆683Updated 5 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,636Updated this week
- Caffe: a fast open framework for deep learning.☆672Updated last year
- nGraph has moved to OpenVINO☆1,352Updated 4 years ago
- Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE☆2,519Updated 2 weeks ago
- Compute Library for Deep Neural Networks (clDNN)☆574Updated last year
- Collective communications library with various primitives for multi-machine training.☆1,228Updated this week
- Intel® Nervana™ reference deep learning framework committed to best performance on all hardware☆3,873Updated 3 years ago
- ImageNet classification using binary Convolutional Neural Networks☆856Updated 6 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,002Updated 6 years ago
- SqueezeNet: AlexNet-level accuracy with 50x fewer parameters☆2,179Updated 6 years ago
- Computation Graph Toolkit☆630Updated 6 years ago
- NumPy interface with mixed backend execution☆1,109Updated 6 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆699Updated 6 years ago
- Open single and half precision gemm implementations☆374Updated last year
- THE Deep Learning Benchmarks☆352Updated 8 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆605Updated 4 years ago