Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
☆448Apr 23, 2026Updated 2 months ago
Alternatives and similar repositories for mct-model-optimization
Users that are interested in mct-model-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Mar 2, 2026Updated 4 months ago
- Fully quantized Neural Networks for Audio Source Separation☆17Aug 11, 2024Updated last year
- ☆19May 15, 2026Updated last month
- ☆49Jul 28, 2020Updated 5 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆191Jun 10, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,647Updated this week
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆38Dec 18, 2021Updated 4 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆36Jun 29, 2023Updated 3 years ago
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,397May 11, 2026Updated last month
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆461May 15, 2023Updated 3 years ago
- A model compression and acceleration toolbox based on pytorch.☆331Jan 12, 2024Updated 2 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆359Apr 11, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆131Sep 23, 2025Updated 9 months ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆17Apr 28, 2021Updated 5 years ago
- Model Quantization Benchmark☆868Apr 20, 2025Updated last year
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 5 years ago
- The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)☆139Nov 19, 2020Updated 5 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆300Aug 1, 2021Updated 4 years ago
- Brevitas: neural network quantization in PyTorch☆1,543Jun 23, 2026Updated last week
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆58Feb 7, 2023Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23Oct 7, 2021Updated 4 years ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,322Sep 7, 2025Updated 9 months ago
- Code that accompanies the paper Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning - Accepted to ICML2024☆15May 8, 2025Updated last year
- PyTorch implementation for the APoT quantization (ICLR 2020)☆288Dec 11, 2024Updated last year
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- HandLandmark Detection that can be performed only in onnxruntime. Pre-focusing by skeletal detection is not performed. This does not use …☆21Apr 30, 2024Updated 2 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆65Feb 15, 2024Updated 2 years ago
- ☆344Feb 12, 2026Updated 4 months ago
- Visualize machine learning models with Netron in VSCode☆19Apr 22, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆45Jul 14, 2021Updated 4 years ago
- ☆173Mar 9, 2023Updated 3 years ago
- Offline Quantization Tools for Deploy.☆143Dec 28, 2023Updated 2 years ago
- Plugins for Neural Network Console.☆16Aug 29, 2025Updated 10 months ago
- Neural Architecture Search for Neural Network Libraries☆64Jan 22, 2024Updated 2 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆41Jan 12, 2021Updated 5 years ago
- Library for rain estimation and detection built with PyTorch. This library provides an implementation of algorithms for extracting rain-r…☆12Jan 10, 2026Updated 5 months ago