aim-uofa/model-quantization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aim-uofa/model-quantization)

aim-uofa / model-quantization

Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)

☆45

Alternatives and similar repositories for model-quantization

Users that are interested in model-quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ziplab / QTool
View on GitHub
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
☆73Oct 7, 2021Updated 4 years ago
deJQK / AdaBits
View on GitHub
☆41Dec 15, 2022Updated 3 years ago
xiezheng-cs / DTQ
View on GitHub
PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)
☆18Jun 22, 2022Updated 4 years ago
billhhh / FQSR
View on GitHub
Codes for ACMMM 2021 paper "Fully Quantized Image Super-Resolution Networks".
☆20Jul 25, 2021Updated 4 years ago
EunhyeokPark / PROFIT
View on GitHub
☆49Jan 21, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zqu1992 / ALQ
View on GitHub
☆14Oct 24, 2022Updated 3 years ago
HuangOwen / QAT-ACS
View on GitHub
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆38Aug 20, 2024Updated last year
papers-submission / CalibTIP
View on GitHub
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆36Jun 29, 2023Updated 3 years ago
bohanzhuang / Group-Net-image-classification
View on GitHub
Structured Binary Neural Networks for Image Recognition
☆18Nov 18, 2021Updated 4 years ago
ziplab / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆31Mar 12, 2024Updated 2 years ago
hustzxd / LSQuantization
View on GitHub
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆139Nov 19, 2020Updated 5 years ago
ThisisBillhe / torch_quantizer
View on GitHub
torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.
☆25Mar 29, 2024Updated 2 years ago
cvlab-yonsei / EWGS
View on GitHub
An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.
☆97Jul 14, 2023Updated 3 years ago
yukang2017 / NAS-quantization
View on GitHub
The code for Joint Neural Architecture Search and Quantization
☆14Apr 10, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bohanzhuang / Towards-Effective-Low-bitwidth-Convolutional-Neural-Networks
View on GitHub
This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"
☆20Aug 30, 2021Updated 4 years ago
ricky40403 / DSQ
View on GitHub
pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"
☆131Jan 2, 2020Updated 6 years ago
ModelTC / mqbench-paper
View on GitHub
☆45Jul 14, 2021Updated 5 years ago
ysbsb / awesome-quantization
View on GitHub
Awesome Quantization Paper lists with Codes
☆10Feb 24, 2021Updated 5 years ago
colorjam / PAMS
View on GitHub
PArameterized Max Scale
☆60Dec 27, 2021Updated 4 years ago
ZiweiWangTHU / GMPQ
View on GitHub
This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…
☆24Aug 17, 2021Updated 4 years ago
zhutmost / lsq-net
View on GitHub
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆315May 8, 2024Updated 2 years ago
itayhubara / CalibTIP
View on GitHub
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆97Jun 10, 2021Updated 5 years ago
aredden / torch-bnb-fp4
View on GitHub
Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops
☆30Mar 16, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ModelTC / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆39Mar 11, 2024Updated 2 years ago
cvlab-yonsei / DAQ
View on GitHub
An official PyTorch implementation of the paper "Distance-aware Quantization", ICCV 2021.
☆48Nov 1, 2024Updated last year
yhhhli / APoT_Quantization
View on GitHub
PyTorch implementation for the APoT quantization (ICLR 2020)
☆288Dec 11, 2024Updated last year
SteveTsui / ReBNN
View on GitHub
☆12Nov 17, 2023Updated 2 years ago
SteveTsui / RBONN
View on GitHub
☆16Nov 25, 2022Updated 3 years ago
MXHX7199 / ICCV_2021_AFP
View on GitHub
AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.
☆13Nov 8, 2021Updated 4 years ago
bigaidream-projects / role-kd
View on GitHub
Role-Wise Data Augmentation for Knowledge Distillation
☆19Nov 22, 2022Updated 3 years ago
lihuantong / HAST
View on GitHub
☆14Oct 6, 2023Updated 2 years ago
BillAmihom / RAPQ
View on GitHub
Pytorch implementation of RAPQ, IJCAI 2022
☆23Jul 19, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jakc4103 / scale-adjusted-training
View on GitHub
PyTorch implementation of Towards Efficient Training for Neural Network Quantization
☆16Jan 16, 2020Updated 6 years ago
doegar / pwcforeveryone
View on GitHub
☆13Oct 30, 2018Updated 7 years ago
Qualcomm-AI-research / BayesianBits
View on GitHub
☆22Feb 11, 2022Updated 4 years ago
amirgholami / ZeroQ
View on GitHub
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
☆280Dec 8, 2023Updated 2 years ago
GATECH-EIC / Auto-NBA
View on GitHub
[ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…
☆16Jan 3, 2022Updated 4 years ago
ali-chr / Semantic-aware-Knowledge-Distillation-for-Few-ShotClass-Incremental-Learning
View on GitHub
CVPR2021
☆12Mar 29, 2021Updated 5 years ago
deJQK / FracBits
View on GitHub
Neural Network Quantization With Fractional Bit-widths
☆11Feb 19, 2021Updated 5 years ago