Efficient-ML / Awesome-Model-QuantizationLinks

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

☆2,177

Alternatives and similar repositories for Awesome-Model-Quantization

Users that are interested in Awesome-Model-Quantization are comparing it to the libraries listed below

Sorting:

Zhen-Dong / Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
☆677Updated 4 months ago
he-y / Awesome-Pruning
A curated list of neural network pruning resources.
☆2,468Updated last year
cedrickchee / awesome-ml-model-compression
Awesome machine learning model compression research papers, quantization, tools, and learning material.
☆526Updated 10 months ago
csyhhu / Awesome-Deep-Neural-Network-Compression
Summary, Code for Deep Neural Network Quantization
☆551Updated last month
MingSun-Tse / Efficient-Deep-Learning
Collection of recent methods on (deep) neural network compression and acceleration.
☆948Updated 3 months ago
ModelTC / MQBench
Model Quantization Benchmark
☆826Updated 3 months ago
Jermmy / pytorch-quantization-demo
A simple network quantization demo using pytorch from scratch.
☆536Updated 2 years ago
AberHu / Knowledge-Distillation-Zoo
Pytorch implementation of various Knowledge Distillation (KD) methods.
☆1,710Updated 3 years ago
OpenPPL / ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
☆1,720Updated last year
mit-han-lab / smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆1,461Updated last year
horseee / Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
☆1,802Updated last month
alibaba / TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
☆838Updated 2 months ago
VainF / Torch-Pruning
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
☆3,099Updated 3 weeks ago
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆443Updated 2 years ago
quic / aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
☆2,383Updated this week
HuangOwen / Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
☆1,618Updated last month
yoshitomo-matsubara / torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods p…
☆1,533Updated last week
hrcheng1066 / awesome-pruning
☆266Updated 11 months ago
mit-han-lab / once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
☆1,927Updated last year
AIoT-MLSys-Lab / Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
☆1,194Updated last month
666DZY666 / micronet
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…
☆2,255Updated 2 months ago
megvii-research / FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆347Updated 2 years ago
FLHonker / Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
☆2,614Updated 2 years ago
ghimiredhikura / Awasome-Pruning
Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.
☆161Updated 11 months ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆282Updated 4 years ago
haitongli / knowledge-distillation-pytorch
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
☆1,957Updated 2 years ago
memoiry / Awesome-model-compression-and-acceleration
☆668Updated 3 years ago
Eric-mingjie / rethinking-network-pruning
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
☆1,514Updated 5 years ago
Efficient-ML / Awesome-Efficient-AIGC
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…
☆186Updated 5 months ago
tianyic / only_train_once_personal_footprint
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆308Updated 10 months ago