A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
☆2,377May 11, 2026Updated last week
Alternatives and similar repositories for Awesome-Model-Quantization
Users that are interested in Awesome-Model-Quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of papers related to neural network quantization in recent AI conferences and journals.☆822Mar 27, 2025Updated last year
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…☆205Feb 10, 2025Updated last year
- Model Quantization Benchmark☆866Apr 20, 2025Updated last year
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆462May 15, 2023Updated 3 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆299Aug 1, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of neural network pruning resources.☆2,493Apr 4, 2024Updated 2 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆361Apr 11, 2023Updated 3 years ago
- Summary, Code for Deep Neural Network Quantization☆562May 13, 2026Updated last week
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,647Jul 12, 2024Updated last year
- Unofficial implementation of LSQ-Net, a neural network quantization framework☆314May 8, 2024Updated 2 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,274May 6, 2025Updated last year
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆280Dec 8, 2023Updated 2 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆287Dec 11, 2024Updated last year
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,796Mar 28, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆131Sep 23, 2025Updated 7 months ago
- Awesome LLM compression research papers and tools.☆1,833Feb 23, 2026Updated 2 months ago
- Post-Training Quantization for Vision transformers.