tensorflow / model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
☆1,493Updated this week
Related projects ⓘ
Alternatives and complementary repositories for model-optimization
- TensorFlow/TensorRT integration☆736Updated 11 months ago
- Tensorflow Backend for ONNX☆1,284Updated 7 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,528Updated 5 years ago
- A performant and modular runtime for TensorFlow☆756Updated last month
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,021Updated 6 months ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,425Updated 2 months ago
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆706Updated this week
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,788Updated last year
- A profiling and performance analysis tool for TensorFlow☆360Updated this week
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,694Updated 2 months ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,884Updated 11 months ago
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,351Updated last year
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,148Updated this week
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆847Updated 3 years ago
- AutoML tools chain☆843Updated last year
- Guide for building custom op for TensorFlow☆378Updated last year
- Convert tf.keras/Keras models to ONNX☆381Updated 3 years ago
- ONNXMLTools enables conversion of models to ONNX☆1,024Updated 5 months ago
- TensorFlow Estimator☆304Updated 9 months ago
- Pruning and other network surgery for trained Keras models.☆405Updated 11 months ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,597Updated this week
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,510Updated 4 years ago
- PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference☆875Updated 5 years ago
- Model analysis tools for TensorFlow☆1,256Updated 2 weeks ago
- ☆662Updated 3 years ago
- ONNX-TensorRT: TensorRT backend for ONNX☆2,953Updated 2 weeks ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆636Updated 4 years ago
- Low-precision matrix multiplication☆1,780Updated 9 months ago
- An Open-Source Library for Training Binarized Neural Networks☆707Updated 3 months ago
- Dive into Deep Learning Compiler☆643Updated 2 years ago