Maratyszcza / NNPACK
Acceleration package for neural networks on multi-core CPUs
☆1,681Updated 7 months ago
Alternatives and similar repositories for NNPACK:
Users that are interested in NNPACK are comparing it to the libraries listed below
- ☆1,656Updated 6 years ago
- Low-precision matrix multiplication☆1,787Updated 11 months ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,682Updated 7 years ago
- Benchmarking Deep Learning operations on different hardware☆1,077Updated 3 years ago
- A domain specific language to express machine learning workloads.☆1,759Updated last year
- Fast Recurrent Networks Library☆579Updated 8 years ago
- OpenCL library to train deep convolutional neural networks☆872Updated 7 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,533Updated 5 years ago
- Caffe: a fast open framework for deep learning.☆674Updated last year
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,111Updated 5 years ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,267Updated 9 months ago
- nGraph has moved to OpenVINO☆1,350Updated 4 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,677Updated this week
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆519Updated 6 years ago
- Compute Library for Deep Neural Networks (clDNN)☆574Updated 2 years ago
- ATen: A TENsor library for C++11☆690Updated 5 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆785Updated 9 years ago
- Open single and half precision gemm implementations☆373Updated last year
- Intel® Nervana™ reference deep learning framework committed to best performance on all hardware☆3,870Updated 4 years ago
- Collective communications library with various primitives for multi-machine training.☆1,253Updated 2 weeks ago
- Compiler for Neural Network hardware accelerators☆3,257Updated 8 months ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,006Updated 6 years ago
- This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…☆846Updated 2 years ago
- ImageNet classification using binary Convolutional Neural Networks☆857Updated 7 years ago
- THE Deep Learning Benchmarks☆351Updated 8 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆610Updated 4 years ago
- Some handy utility libraries and tools for the Caffe deep learning framework.☆457Updated 5 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,240Updated this week
- Assembler for NVIDIA Maxwell architecture☆963Updated 2 years ago