pytorch / QNNPACKLinks
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
☆1,550Updated 6 years ago
Alternatives and similar repositories for QNNPACK
Users that are interested in QNNPACK are comparing it to the libraries listed below
Sorting:
- Low-precision matrix multiplication☆1,832Updated 2 years ago
- FeatherCNN is a high performance inference engine for convolutional neural networks.☆1,228Updated 6 years ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆536Updated 3 years ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,912Updated 2 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆956Updated 9 months ago
- Tensorflow Backend for ONNX☆1,325Updated last year
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,032Updated last week
- TVM integration into PyTorch☆456Updated 6 years ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆631Updated 5 years ago
- Embedded and mobile deep learning research resources☆761Updated 2 years ago
- Memory consumption and FLOP count estimates for convnets☆931Updated 7 years ago
- A domain specific language to express machine learning workloads.☆1,765Updated 2 years ago
- nGraph has moved to OpenVINO☆1,346Updated 5 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Updated this week
- Benchmarking Neural Network Inference on Mobile Devices☆386Updated 2 years ago
- Mobile AI Compute Engine Model Zoo☆375Updated 4 years ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,448Updated last year
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,086Updated last year
- Acceleration package for neural networks on multi-core CPUs☆1,703Updated last year
- ImageNet classification using binary Convolutional Neural Networks☆866Updated 8 years ago
- A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.☆1,562Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,525Updated this week
- TensorFlow/TensorRT integration☆743Updated 2 years ago
- ☆1,655Updated 7 years ago
- Dive into Deep Learning Compiler☆644Updated 3 years ago
- Arm NN ML Software.☆1,296Updated 2 weeks ago
- WeChat: NeuralTalk,Weekly report and awesome list of embedded-ai.☆379Updated 3 years ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,245Updated this week
- PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference☆886Updated 6 years ago
- Facebook AI Performance Evaluation Platform☆392Updated 3 weeks ago