Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
☆432Mar 19, 2026Updated this week
Alternatives and similar repositories for mct-model-optimization
Users that are interested in mct-model-optimization are comparing it to the libraries listed below
Sorting:
- ☆23Mar 2, 2026Updated 2 weeks ago
- ☆48Jul 28, 2020Updated 5 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆179Mar 10, 2026Updated last week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,566Mar 14, 2026Updated last week
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Dec 18, 2021Updated 4 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆35Jun 29, 2023Updated 2 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆454May 15, 2023Updated 2 years ago
- A model compression and acceleration toolbox based on pytorch.☆331Jan 12, 2024Updated 2 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆357Apr 11, 2023Updated 2 years ago
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,334Jan 29, 2026Updated last month
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆128Sep 23, 2025Updated 5 months ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- Model Quantization Benchmark☆862Apr 20, 2025Updated 11 months ago
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 5 years ago
- The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)☆139Nov 19, 2020Updated 5 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆292Aug 1, 2021Updated 4 years ago
- Brevitas: neural network quantization in PyTorch☆1,502Updated this week
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆57Feb 7, 2023Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- ☆23Oct 7, 2021Updated 4 years ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,267Sep 7, 2025Updated 6 months ago
- Code that accompanies the paper Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning - Accepted to ICML2024☆15May 8, 2025Updated 10 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆287Dec 11, 2024Updated last year
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- HandLandmark Detection that can be performed only in onnxruntime. Pre-focusing by skeletal detection is not performed. This does not use …☆20Apr 30, 2024Updated last year
- ☆340Feb 12, 2026Updated last month
- Visualize machine learning models with Netron in VSCode☆16Nov 23, 2025Updated 3 months ago
- ☆45Jul 14, 2021Updated 4 years ago
- ☆169Mar 9, 2023Updated 3 years ago
- Offline Quantization Tools for Deploy.☆143Dec 28, 2023Updated 2 years ago
- Plugins for Neural Network Console.☆17Aug 29, 2025Updated 6 months ago
- Neural Architecture Search for Neural Network Libraries☆61Jan 22, 2024Updated 2 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆42Jan 12, 2021Updated 5 years ago
- Library for rain estimation and detection built with PyTorch. This library provides an implementation of algorithms for extracting rain-r…☆12Jan 10, 2026Updated 2 months ago
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,617Nov 19, 2025Updated 4 months ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆282Dec 8, 2023Updated 2 years ago