nbasyl/OFQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nbasyl/OFQ)

nbasyl / OFQ

The official implementation of the ICML 2023 paper OFQ-ViT

☆39

Alternatives and similar repositories for OFQ

Users that are interested in OFQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / Ternary_Binary_Transformer
View on GitHub
ACL 2023
☆39Jun 6, 2023Updated 3 years ago
HuangOwen / QAT-ACS
View on GitHub
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆38Aug 20, 2024Updated last year
HuangOwen / Quantization-Variation
View on GitHub
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆49Sep 27, 2024Updated last year
Qualcomm-AI-research / oscillations-qat
View on GitHub
☆81Jul 21, 2022Updated 3 years ago
hustvl / PD-Quant
View on GitHub
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆61Mar 23, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liuzechun / Nonuniform-to-Uniform-Quantization
View on GitHub
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆139Apr 28, 2022Updated 4 years ago
hahnyuan / PTQ4ViT
View on GitHub
Post-Training Quantization for Vision transformers.
☆242Jul 19, 2022Updated 4 years ago
YanjingLi0202 / Q-ViT
View on GitHub
The official implementation of the NeurIPS 2022 paper Q-ViT.
☆105May 22, 2023Updated 3 years ago
PingchengDong / GQA-LUT
View on GitHub
The official implementation of the DAC 2024 paper GQA-LUT
☆24Dec 20, 2024Updated last year
facebookresearch / bit
View on GitHub
Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer
☆115Jun 26, 2023Updated 3 years ago
moranshkolnik / RobustQuantization
View on GitHub
source code of the paper: Robust Quantization: One Model to Rule Them All
☆42Mar 24, 2023Updated 3 years ago
DravenALG / ReSTE
View on GitHub
(ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).
☆34Sep 20, 2024Updated last year
kriskrisliu / NoisyQuant
View on GitHub
An official implement of CVPR 2023 paper - NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
☆27Mar 13, 2024Updated 2 years ago
xvyaward / owq
View on GitHub
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆72Mar 7, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
wimh966 / outlier_suppression
View on GitHub
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆49Oct 5, 2022Updated 3 years ago
ThisisBillhe / BiViT
View on GitHub
The official implementation of BiViT: Extremely Compressed Binary Vision Transformers
☆16Jun 18, 2023Updated 3 years ago
ModelTC / Outlier_Suppression_Plus
View on GitHub
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆52Oct 21, 2023Updated 2 years ago
zkkli / I-ViT
View on GitHub
[ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference
☆207Sep 2, 2024Updated last year
megvii-research / FQ-ViT
View on GitHub
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆359Apr 11, 2023Updated 3 years ago
zhutmost / lsq-net
View on GitHub
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆315May 8, 2024Updated 2 years ago
wujx2001 / QwT
View on GitHub
Official PyTorch implementation of QwT—“Quantization without Tears” (CVPR 2025): fast, accurate, and hassle-free post-training network qu…
☆32Sep 30, 2025Updated 9 months ago
IST-DASLab / gemm-fp8
View on GitHub
High Performance FP8 GEMM Kernels for SM89 and later GPUs.
☆21Jan 24, 2025Updated last year
wimh966 / QDrop
View on GitHub
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆131Sep 23, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nbasyl / LLM-FP4
View on GitHub
The official implementation of the EMNLP 2023 paper LLM-FP4
☆224Dec 15, 2023Updated 2 years ago
facebookresearch / LLM-QAT
View on GitHub
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
☆325Mar 4, 2025Updated last year
GATECH-EIC / ShiftAddViT
View on GitHub
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Dec 6, 2023Updated 2 years ago
SteveTsui / Q-DETR
View on GitHub
☆38Sep 3, 2023Updated 2 years ago
plumerai / rethinking-bnn-optimization
View on GitHub
Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"
☆76Dec 8, 2019Updated 6 years ago
Qualcomm-AI-research / transformer-quantization
View on GitHub
☆211Nov 9, 2021Updated 4 years ago
cjf00000 / StatQuant
View on GitHub
code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"
☆29Oct 31, 2020Updated 5 years ago
AI-Efficiency / BiBench
View on GitHub
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…
☆56Mar 4, 2024Updated 2 years ago
1157942086 / CVPR2020_Auxiliary_Quantization
View on GitHub
Training Quantized Neural Networks with a Full-precision Auxiliary Module
☆13Jun 19, 2020Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hahnyuan / PB-LLM
View on GitHub
PB-LLM: Partially Binarized Large Language Models
☆158Nov 20, 2023Updated 2 years ago
ziplab / QTool
View on GitHub
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
☆73Oct 7, 2021Updated 4 years ago
HuangOwen / RoLoRA
View on GitHub
[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
☆40Sep 24, 2024Updated last year
pprp / STBLLM
View on GitHub
[ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
☆20Jun 3, 2025Updated last year
Qualcomm-AI-research / lr-qat
View on GitHub
☆54Nov 5, 2024Updated last year
csyhhu / MetaQuant
View on GitHub
Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019
☆54May 8, 2020Updated 6 years ago
hustzxd / LSQuantization
View on GitHub
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆139Nov 19, 2020Updated 5 years ago