kssteven418/I-BERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kssteven418/I-BERT)

kssteven418 / I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

☆269

Alternatives and similar repositories for I-BERT

Users that are interested in I-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kssteven418 / Q-ASR
View on GitHub
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
☆34Oct 11, 2021Updated 4 years ago
zkkli / I-ViT
View on GitHub
[ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference
☆207Sep 2, 2024Updated last year
hahnyuan / PTQ4ViT
View on GitHub
Post-Training Quantization for Vision transformers.
☆242Jul 19, 2022Updated 4 years ago
amirgholami / ZeroQ
View on GitHub
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
☆280Dec 8, 2023Updated 2 years ago
Zhen-Dong / HAWQ
View on GitHub
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆462May 15, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
megvii-research / FQ-ViT
View on GitHub
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆359Apr 11, 2023Updated 3 years ago
yhhhli / APoT_Quantization
View on GitHub
PyTorch implementation for the APoT quantization (ICLR 2020)
☆288Dec 11, 2024Updated last year
yhhhli / BRECQ
View on GitHub
Pytorch implementation of BRECQ, ICLR 2021
☆300Aug 1, 2021Updated 4 years ago
itayhubara / CalibTIP
View on GitHub
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆97Jun 10, 2021Updated 5 years ago
Qualcomm-AI-research / transformer-quantization
View on GitHub
☆211Nov 9, 2021Updated 4 years ago
GATECH-EIC / ViTCoD
View on GitHub
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆133Jun 27, 2023Updated 3 years ago
deJQK / FracBits
View on GitHub
Neural Network Quantization With Fractional Bit-widths
☆11Feb 19, 2021Updated 5 years ago
peiswang / BitSplit
View on GitHub
BitSplit Post-trining Quantization
☆49Dec 20, 2021Updated 4 years ago
sIncerass / QBERT
View on GitHub
☆15Oct 26, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
wimh966 / outlier_suppression
View on GitHub
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆49Oct 5, 2022Updated 3 years ago
1157942086 / CVPR2020_Auxiliary_Quantization
View on GitHub
Training Quantized Neural Networks with a Full-precision Auxiliary Module
☆13Jun 19, 2020Updated 6 years ago
moranshkolnik / RobustQuantization
View on GitHub
source code of the paper: Robust Quantization: One Model to Rule Them All
☆42Mar 24, 2023Updated 3 years ago
liuzechun / Nonuniform-to-Uniform-Quantization
View on GitHub
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆139Apr 28, 2022Updated 4 years ago
Guangxuan-Xiao / torch-int
View on GitHub
This repository contains integer operators on GPUs for PyTorch.
☆235Sep 29, 2023Updated 2 years ago
zhutmost / lsq-net
View on GitHub
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆315May 8, 2024Updated 2 years ago
cjf00000 / StatQuant
View on GitHub
code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"
☆29Oct 31, 2020Updated 5 years ago
wimh966 / QDrop
View on GitHub
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆131Sep 23, 2025Updated 9 months ago
zhexinli / Q-ViT-DeiT
View on GitHub
DeiT implementation for Q-ViT
☆26Apr 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mit-han-lab / smoothquant
View on GitHub
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆1,670Jul 12, 2024Updated 2 years ago
hustzxd / LSQuantization
View on GitHub
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆139Nov 19, 2020Updated 5 years ago
yaozhewei / HAP
View on GitHub
☆43Jan 30, 2024Updated 2 years ago
zkkli / RepQ-ViT
View on GitHub
[ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers
☆144Jan 10, 2024Updated 2 years ago
jakc4103 / scale-adjusted-training
View on GitHub
PyTorch implementation of Towards Efficient Training for Neural Network Quantization
☆16Jan 16, 2020Updated 6 years ago
AI-Efficiency / Awesome-Model-Quantization
View on GitHub
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…
☆2,406Jul 10, 2026Updated last week
jakc4103 / DFQ
View on GitHub
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
☆264Oct 3, 2023Updated 2 years ago
hustvl / PD-Quant
View on GitHub
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆61Mar 23, 2023Updated 3 years ago
mit-han-lab / haq
View on GitHub
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
☆408Feb 26, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
albertomarchisio / SwiftTron
View on GitHub
☆50Apr 8, 2023Updated 3 years ago
ricky40403 / DSQ
View on GitHub
pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"
☆131Jan 2, 2020Updated 6 years ago
deepglint / EasyQuant
View on GitHub
EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…
☆407Nov 22, 2022Updated 3 years ago
gilshm / sparq
View on GitHub
Post-training sparsity-aware quantization
☆34Feb 26, 2023Updated 3 years ago
SqueezeAILab / open_source_projects
View on GitHub
Open Source Projects from Pallas Lab
☆21Oct 10, 2021Updated 4 years ago
zkkli / PSAQ-ViT
View on GitHub
[ECCV 2022] Patch Similarity Aware Data-Free Quantization for Vision Transformers
☆124Dec 22, 2022Updated 3 years ago
uber-research / permute-quantize-finetune
View on GitHub
Using ideas from product quantization for state-of-the-art neural network compression.
☆146Aug 14, 2021Updated 4 years ago