tensorflow / model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
☆1,493Updated this week
Related projects ⓘ
Alternatives and complementary repositories for model-optimization
- TensorFlow/TensorRT integration☆737Updated 11 months ago
- Tensorflow Backend for ONNX☆1,285Updated 7 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,527Updated 5 years ago
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,320Updated 2 months ago
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,351Updated last year
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,692Updated 2 months ago
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,020Updated 5 months ago
- Tutorials for creating and using ONNX models☆3,372Updated 3 months ago
- ONNXMLTools enables conversion of models to ONNX☆1,019Updated 5 months ago
- QKeras: a quantization deep learning library for Tensorflow Keras☆539Updated 2 weeks ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,786Updated last year
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,881Updated 10 months ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,425Updated 2 months ago
- A performant and modular runtime for TensorFlow☆756Updated 3 weeks ago
- Convert tf.keras/Keras models to ONNX☆381Updated 3 years ago
- A profiling and performance analysis tool for TensorFlow☆359Updated this week
- Guide for building custom op for TensorFlow☆378Updated last year
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,585Updated this week
- PyTorch to Keras model convertor☆857Updated last year
- Actively maintained ONNX Optimizer☆645Updated 8 months ago
- ONNX-TensorRT: TensorRT backend for ONNX☆2,948Updated this week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,138Updated this week
- Convert ONNX model graph to Keras model format.☆195Updated 4 months ago
- The convertor/conversion of deep learning models for different deep learning frameworks/softwares.☆3,243Updated last year
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 4 years ago
- TVM integration into PyTorch☆453Updated 4 years ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆636Updated 4 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,508Updated 4 years ago
- Save, Load Frozen Graph and Run Inference From Frozen Graph in TensorFlow 1.x and 2.x☆300Updated 3 years ago
- MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…☆5,797Updated 5 months ago