mit-han-lab/haq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mit-han-lab/haq)

mit-han-lab / haq

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

☆408

Alternatives and similar repositories for haq

Users that are interested in haq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mit-han-lab / amc
View on GitHub
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆449Nov 22, 2023Updated 2 years ago
mit-han-lab / apq
View on GitHub
[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
☆160Jun 16, 2020Updated 6 years ago
zhaoweicai / EdMIPS
View on GitHub
PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf
☆61Jul 27, 2020Updated 5 years ago
Zhen-Dong / HAWQ
View on GitHub
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆462May 15, 2023Updated 3 years ago
amirgholami / ZeroQ
View on GitHub
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
☆280Dec 8, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mit-han-lab / proxylessnas
View on GitHub
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
☆1,446Aug 30, 2024Updated last year
peiswang / BitSplit
View on GitHub
BitSplit Post-trining Quantization
☆49Dec 20, 2021Updated 4 years ago
deJQK / FracBits
View on GitHub
Neural Network Quantization With Fractional Bit-widths
☆11Feb 19, 2021Updated 5 years ago
microsoft / LQ-Nets
View on GitHub
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
☆245Aug 30, 2022Updated 3 years ago
EECS-NTNU / bismo
View on GitHub
BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing
☆150Dec 25, 2019Updated 6 years ago
submission2019 / cnn-quantization
View on GitHub
Quantization of Convolutional Neural networks.
☆250Aug 5, 2024Updated last year
mit-han-lab / amc-models
View on GitHub
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆168Feb 26, 2021Updated 5 years ago
zhijian-liu / torchprofile
View on GitHub
Count the MACs / FLOPs of PyTorch models
☆643Mar 11, 2026Updated 4 months ago
yhhhli / APoT_Quantization
View on GitHub
PyTorch implementation for the APoT quantization (ICLR 2020)
☆288Dec 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / kill-the-bits
View on GitHub
Code for: "And the bit goes down: Revisiting the quantization of neural networks"
☆630Nov 9, 2020Updated 5 years ago
ricky40403 / DSQ
View on GitHub
pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"
☆131Jan 2, 2020Updated 6 years ago
yanghr / BSQ
View on GitHub
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆41Jan 12, 2021Updated 5 years ago
Mxbonn / INQ-pytorch
View on GitHub
A PyTorch implementation of "Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights"
☆165Mar 8, 2020Updated 6 years ago
mrusci / training-mixed-precision-quantized-networks
View on GitHub
This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…
☆51May 9, 2024Updated 2 years ago
mit-han-lab / once-for-all
View on GitHub
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
☆1,953Dec 14, 2023Updated 2 years ago
yhhhli / BRECQ
View on GitHub
Pytorch implementation of BRECQ, ICLR 2021
☆300Aug 1, 2021Updated 4 years ago
ZiweiWangTHU / GMPQ
View on GitHub
This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…
☆24Aug 17, 2021Updated 4 years ago
JiahuiYu / slimmable_networks
View on GitHub
Slimmable Networks, AutoSlim, and Beyond, ICLR 2019, and ICCV 2019
☆929Mar 9, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hsharma35 / bitfusion
View on GitHub
Simulator for BitFusion
☆103Aug 6, 2020Updated 5 years ago
itayhubara / CalibTIP
View on GitHub
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆97Jun 10, 2021Updated 5 years ago
zqu1992 / ALQ
View on GitHub
☆14Oct 24, 2022Updated 3 years ago
csyhhu / Awesome-Deep-Neural-Network-Compression
View on GitHub
Summary, Code for Deep Neural Network Quantization
☆562Updated this week
666DZY666 / micronet
View on GitHub
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…
☆2,266May 6, 2025Updated last year
1adrianb / binary-nas
View on GitHub
☆35Mar 4, 2020Updated 6 years ago
mit-han-lab / hardware-aware-transformers
View on GitHub
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
☆336Jul 14, 2024Updated 2 years ago
xiezheng-cs / DTQ
View on GitHub
PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)
☆18Jun 22, 2022Updated 4 years ago
zhutmost / lsq-net
View on GitHub
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆316May 8, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Jzz24 / pytorch_quantization
View on GitHub
A pytorch implementation of dorefa quantization
☆114Dec 30, 2019Updated 6 years ago
antspy / quantized_distillation
View on GitHub
Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"
☆336Jul 25, 2024Updated 2 years ago
liuzechun / MetaPruning
View on GitHub
MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. In ICCV 2019.
☆352Jul 5, 2020Updated 6 years ago
hustzxd / LSQuantization
View on GitHub
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆139Nov 19, 2020Updated 5 years ago
jakc4103 / scale-adjusted-training
View on GitHub
PyTorch implementation of Towards Efficient Training for Neural Network Quantization
☆16Jan 16, 2020Updated 6 years ago
htqin / IR-Net
View on GitHub
[CVPR 2020] This project is the PyTorch implementation of our accepted CVPR 2020 paper : forward and backward information retention for a…
☆181Mar 14, 2020Updated 6 years ago
jun-fang / PWLQ
View on GitHub
Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks
☆68Nov 4, 2021Updated 4 years ago