A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
☆1,571Apr 14, 2026Updated this week
Alternatives and similar repositories for model-optimization
Users that are interested in model-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,910Mar 31, 2023Updated 3 years ago
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,706Sep 4, 2025Updated 7 months ago
- TensorFlow/TensorRT integration☆743Nov 30, 2023Updated 2 years ago
- Model analysis tools for TensorFlow☆1,265Aug 6, 2025Updated 8 months ago
- TFX is an end-to-end platform for deploying production ML pipelines☆2,181Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A profiling and performance analysis tool for machine learning☆500Updated this week
- QKeras: a quantization deep learning library for Tensorflow Keras☆580Feb 23, 2026Updated last month
- A Hyperparameter Tuning Library for Keras☆2,925Dec 1, 2025Updated 4 months ago
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,346Jul 3, 2024Updated last year
- Open Machine Learning Compiler Framework☆13,268Apr 13, 2026Updated last week
- Lingvo☆2,860Mar 30, 2026Updated 2 weeks ago
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,525Apr 2, 2026Updated 2 weeks ago
- TensorFlow Estimator☆298Jan 23, 2024Updated 2 years ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,902Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Guide for building custom op for TensorFlow☆385Mar 23, 2023Updated 3 years ago
- Making text a first-class citizen in TensorFlow.☆1,284Updated this week
- TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...☆4,554Mar 27, 2026Updated 3 weeks ago
- A flexible, high-performance serving system for machine learning models☆6,350Updated this week
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,548Aug 28, 2019Updated 6 years ago
- A performant and modular runtime for TensorFlow☆753Sep 4, 2025Updated 7 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,689Dec 1, 2025Updated 4 months ago
- Visualizer for neural network, deep learning and machine learning models☆32,736Apr 13, 2026Updated last week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,592Apr 11, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reference models and tools for Cloud TPUs.☆5,275Mar 25, 2026Updated 3 weeks ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,269May 6, 2025Updated 11 months ago
- Low-precision matrix multiplication☆1,838Jan 29, 2024Updated 2 years ago
- MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…☆5,812Aug 7, 2025Updated 8 months ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,966Updated this week
- Fast and flexible AutoML with learning guarantees.☆3,456Nov 30, 2023Updated 2 years ago
- A library for transfer learning by reusing parts of TensorFlow models.☆3,523Jan 17, 2025Updated last year
- Simplify your onnx model☆4,321Apr 7, 2026Updated last week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,672Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Input pipeline framework☆990Aug 6, 2025Updated 8 months ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,625Nov 17, 2023Updated 2 years ago
- Google Brain AutoML☆6,464Mar 2, 2025Updated last year
- Compiler for Neural Network hardware accelerators☆3,326May 11, 2024Updated last year
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆631Nov 9, 2020Updated 5 years ago
- AutoML library for deep learning☆9,315Nov 25, 2025Updated 4 months ago
- Library for exploring and validating machine learning data☆779Mar 23, 2026Updated 3 weeks ago