pytorch / QNNPACK
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
☆1,538Updated 5 years ago
Alternatives and similar repositories for QNNPACK:
Users that are interested in QNNPACK are comparing it to the libraries listed below
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,857Updated 2 years ago
- Low-precision matrix multiplication☆1,798Updated last year
- A domain specific language to express machine learning workloads.☆1,759Updated last year
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆633Updated 4 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆940Updated this week
- Acceleration package for neural networks on multi-core CPUs☆1,687Updated 10 months ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆533Updated 2 years ago
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,387Updated last year
- nGraph has moved to OpenVINO☆1,350Updated 4 years ago
- FeatherCNN is a high performance inference engine for convolutional neural networks.☆1,217Updated 5 years ago
- TVM integration into PyTorch☆452Updated 5 years ago
- Memory consumption and FLOP count estimates for convnets☆918Updated 6 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,293Updated this week
- Tensorflow Backend for ONNX☆1,301Updated last year
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,436Updated 7 months ago
- WeChat: NeuralTalk,Weekly report and awesome list of embedded-ai.☆378Updated 2 years ago
- Benchmarking Neural Network Inference on Mobile Devices☆370Updated 2 years ago
- ☆1,659Updated 6 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆614Updated 4 years ago
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,084Updated 11 months ago
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,028Updated 2 months ago
- A performant and modular runtime for TensorFlow☆759Updated last month
- ImageNet classification using binary Convolutional Neural Networks☆857Updated 7 years ago
- Embedded and mobile deep learning research resources☆746Updated 2 years ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆1,998Updated this week
- PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference☆878Updated 5 years ago
- Mobile AI Compute Engine Model Zoo☆376Updated 3 years ago
- Generate a quantization parameter file for ncnn framework int8 inference☆519Updated 4 years ago
- A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.☆1,530Updated 2 months ago
- Benchmarking Deep Learning operations on different hardware☆1,082Updated 3 years ago