tensorflow / model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
☆1,532Updated 2 months ago
Alternatives and similar repositories for model-optimization:
Users that are interested in model-optimization are comparing it to the libraries listed below
- Tensorflow Backend for ONNX☆1,302Updated last year
- TensorFlow/TensorRT integration☆741Updated last year
- A profiling and performance analysis tool for machine learning☆372Updated this week
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,029Updated 2 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,539Updated 5 years ago
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆853Updated 3 years ago
- Convert tf.keras/Keras models to ONNX☆378Updated 3 years ago
- Low-precision matrix multiplication☆1,799Updated last year
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,404Updated 2 months ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,439Updated 7 months ago
- QKeras: a quantization deep learning library for Tensorflow Keras☆563Updated 3 weeks ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆633Updated 4 years ago
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,389Updated 2 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆940Updated 2 weeks ago
- A performant and modular runtime for TensorFlow☆759Updated last week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,282Updated this week
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆724Updated this week
- TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"☆1,580Updated 5 years ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,868Updated 2 years ago
- AutoML tools chain☆850Updated 2 years ago
- Arm NN ML Software. The code here is a read-only mirror of https://review.mlplatform.org/admin/repos/ml/armnn☆1,251Updated this week
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,908Updated last year
- Guide for building custom op for TensorFlow☆381Updated 2 years ago
- TensorFlow Estimator☆301Updated last year
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆685Updated 5 years ago
- ☆669Updated 3 years ago
- An Open-Source Library for Training Binarized Neural Networks☆715Updated 8 months ago
- PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference☆879Updated 5 years ago
- Save, Load Frozen Graph and Run Inference From Frozen Graph in TensorFlow 1.x and 2.x☆302Updated 4 years ago
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,698Updated 3 weeks ago