Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
☆431Updated this week
Alternatives and similar repositories for mct-model-optimization
Users that are interested in mct-model-optimization are comparing it to the libraries listed below
Sorting:
- ☆23Feb 10, 2026Updated 2 weeks ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆178Feb 19, 2026Updated last week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,563Updated this week
- ☆28Oct 21, 2020Updated 5 years ago
- A model compression and acceleration toolbox based on pytorch.☆333Jan 12, 2024Updated 2 years ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Dec 18, 2021Updated 4 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆360Apr 11, 2023Updated 2 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453May 15, 2023Updated 2 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆35Jun 29, 2023Updated 2 years ago
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 5 years ago
- Model Quantization Benchmark☆858Apr 20, 2025Updated 10 months ago
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,327Jan 29, 2026Updated last month
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆128Sep 23, 2025Updated 5 months ago
- ☆23Oct 7, 2021Updated 4 years ago
- Brevitas: neural network quantization in PyTorch☆1,488Updated this week
- PyTorch implementation for the APoT quantization (ICLR 2020)☆283Dec 11, 2024Updated last year
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆58Feb 7, 2023Updated 3 years ago
- Offline Quantization Tools for Deploy.☆142Dec 28, 2023Updated 2 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆290Aug 1, 2021Updated 4 years ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,258Sep 7, 2025Updated 5 months ago
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆93May 5, 2022Updated 3 years ago
- ☆79Jul 21, 2022Updated 3 years ago
- HandLandmark Detection that can be performed only in onnxruntime. Pre-focusing by skeletal detection is not performed. This does not use …☆20Apr 30, 2024Updated last year
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 2 years ago
- RepGhost: A Hardware-Efficient Ghost Module via Re-parameterization☆183Aug 15, 2023Updated 2 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆281Dec 8, 2023Updated 2 years ago
- The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)☆139Nov 19, 2020Updated 5 years ago
- provide some new architecture, channel pruning and quantization methods for yolov5☆31Oct 13, 2025Updated 4 months ago
- ☆45Jul 14, 2021Updated 4 years ago
- Code for the ICLR2020 "Training Binary Neural Networks with Real-to-Binary Convolutions☆34Jun 16, 2020Updated 5 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- Neural Architecture Search for Neural Network Libraries☆61Jan 22, 2024Updated 2 years ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- ☆57Dec 8, 2020Updated 5 years ago
- ☆342Feb 12, 2026Updated 2 weeks ago