tensorflow / model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
☆1,523Updated 3 weeks ago
Alternatives and similar repositories for model-optimization:
Users that are interested in model-optimization are comparing it to the libraries listed below
- TensorFlow/TensorRT integration☆740Updated last year
- Tensorflow Backend for ONNX☆1,295Updated 11 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,536Updated 5 years ago
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,371Updated last year
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,376Updated 3 weeks ago
- A profiling and performance analysis tool for TensorFlow☆367Updated this week
- Model analysis tools for TensorFlow☆1,261Updated 2 weeks ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,802Updated last year
- Convert tf.keras/Keras models to ONNX☆378Updated 3 years ago
- A performant and modular runtime for TensorFlow☆759Updated last week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,232Updated this week
- An Open-Source Library for Training Binarized Neural Networks☆712Updated 6 months ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,264Updated this week
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,026Updated last month
- Save, Load Frozen Graph and Run Inference From Frozen Graph in TensorFlow 1.x and 2.x☆301Updated 4 years ago
- Low-precision matrix multiplication☆1,792Updated last year
- Guide for building custom op for TensorFlow☆378Updated last year
- TensorFlow Estimator☆301Updated last year
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,695Updated 6 months ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,899Updated last year
- Mesh TensorFlow: Model Parallelism Made Easier☆1,603Updated last year
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,433Updated 6 months ago
- PyTorch to Keras model convertor☆861Updated 2 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆439Updated last year
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆721Updated this week
- QKeras: a quantization deep learning library for Tensorflow Keras☆556Updated 2 weeks ago
- TVM integration into PyTorch☆452Updated 5 years ago
- Papers for deep neural network compression and acceleration☆396Updated 3 years ago
- Facebook AI Performance Evaluation Platform☆390Updated 3 weeks ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆1,968Updated this week