tensorflow / model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
☆1,525Updated last month
Alternatives and similar repositories for model-optimization:
Users that are interested in model-optimization are comparing it to the libraries listed below
- TensorFlow/TensorRT integration☆739Updated last year
- Tensorflow Backend for ONNX☆1,296Updated 11 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,536Updated 5 years ago
- A profiling and performance analysis tool for TensorFlow☆369Updated this week
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,374Updated last year
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,695Updated 6 months ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆634Updated 4 years ago
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,381Updated last month
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,027Updated last month
- QKeras: a quantization deep learning library for Tensorflow Keras☆557Updated last month
- Model analysis tools for TensorFlow☆1,262Updated last month
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,432Updated 6 months ago
- A performant and modular runtime for TensorFlow☆759Updated 3 weeks ago
- Convert tf.keras/Keras models to ONNX☆378Updated 3 years ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,807Updated last year
- ONNXMLTools enables conversion of models to ONNX☆1,055Updated 2 months ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆940Updated 7 months ago
- Save, Load Frozen Graph and Run Inference From Frozen Graph in TensorFlow 1.x and 2.x☆301Updated 4 years ago
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆851Updated 3 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,515Updated 4 years ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,901Updated last year
- Low-precision matrix multiplication☆1,792Updated last year
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆722Updated last week
- ☆666Updated 3 years ago
- Pruning and other network surgery for trained Keras models.☆408Updated last year
- An Open-Source Library for Training Binarized Neural Networks☆713Updated 7 months ago
- Papers for deep neural network compression and acceleration☆396Updated 3 years ago
- AutoML tools chain☆849Updated 2 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,271Updated this week
- TensorFlow Estimator☆301Updated last year