Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
☆434Mar 25, 2026Updated 2 weeks ago
Alternatives and similar repositories for mct-model-optimization
Users that are interested in mct-model-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Mar 2, 2026Updated last month
- ☆18Apr 4, 2025Updated last year
- Raspberry Pi AI Camera (IMX500) Model Zoo☆139Jul 22, 2025Updated 8 months ago
- ☆19Mar 6, 2026Updated last month
- ☆48Jul 28, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆180Mar 25, 2026Updated 2 weeks ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,585Apr 4, 2026Updated last week
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Dec 18, 2021Updated 4 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆36Jun 29, 2023Updated 2 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆458May 15, 2023Updated 2 years ago
- A model compression and acceleration toolbox based on pytorch.☆332Jan 12, 2024Updated 2 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆359Apr 11, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,343Apr 5, 2026Updated last week
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆129Sep 23, 2025Updated 6 months ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- Model Quantization Benchmark☆864Apr 20, 2025Updated 11 months ago
- ☆13Feb 12, 2026Updated 2 months ago
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 5 years ago
- The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)☆138Nov 19, 2020Updated 5 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆296Aug 1, 2021Updated 4 years ago
- Brevitas: neural network quantization in PyTorch☆1,512Apr 4, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆57Feb 7, 2023Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- ☆23Oct 7, 2021Updated 4 years ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,278Sep 7, 2025Updated 7 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆286Dec 11, 2024Updated last year
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- HandLandmark Detection that can be performed only in onnxruntime. Pre-focusing by skeletal detection is not performed. This does not use …☆20Apr 30, 2024Updated last year
- ☆341Feb 12, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Visualize machine learning models with Netron in VSCode☆17Nov 23, 2025Updated 4 months ago
- ☆45Jul 14, 2021Updated 4 years ago
- ☆171Mar 9, 2023Updated 3 years ago
- Offline Quantization Tools for Deploy.☆144Dec 28, 2023Updated 2 years ago
- Plugins for Neural Network Console.☆17Aug 29, 2025Updated 7 months ago
- Neural Architecture Search for Neural Network Libraries☆62Jan 22, 2024Updated 2 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆42Jan 12, 2021Updated 5 years ago