Efficient-ML / Awesome-Model-Quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
☆1,872Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Model-Quantization
- List of papers related to neural network quantization in recent AI conferences and journals.☆453Updated last month
- A curated list of neural network pruning resources.☆2,356Updated 7 months ago
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆487Updated last month
- Collection of recent methods on (deep) neural network compression and acceleration.☆927Updated 2 months ago
- Model Quantization Benchmark☆762Updated 5 months ago
- Summary, Code for Deep Neural Network Quantization☆530Updated 3 weeks ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning☆2,704Updated 3 weeks ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,217Updated 3 years ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,551Updated 7 months ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆413Updated last year
- Pytorch implementation of various Knowledge Distillation (KD) methods.☆1,611Updated 2 years ago
- A simple network quantization demo using pytorch from scratch.☆508Updated last year
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,142Updated this week
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆940Updated this week
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,881Updated 10 months ago
- A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, I…☆1,391Updated 3 weeks ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,508Updated 4 years ago
- ☆660Updated 3 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,245Updated 4 months ago
- Efficient computing methods developed by Huawei Noah's Ark Lab☆1,199Updated last week
- Brevitas: neural network quantization in PyTorch☆1,199Updated this week
- Awesome LLM compression research papers and tools.☆1,177Updated this week
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆308Updated last year
- Papers for deep neural network compression and acceleration☆395Updated 3 years ago
- knowledge distillation papers☆741Updated last year
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,489Updated last year
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆369Updated 3 years ago
- Model analyzer in PyTorch☆1,466Updated last year
- PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.☆424Updated last year
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,472Updated 5 months ago