Alexstrasza98 / Transformer-QuantizationLinks

The final project repository for 2022 Spring COMS6998-009 Deep Learning System Performance in Columbia University.

☆7

Alternatives and similar repositories for Transformer-Quantization

Users that are interested in Transformer-Quantization are comparing it to the libraries listed below

Sorting:

zhexinli / Q-ViT-DeiT
DeiT implementation for Q-ViT
☆24Updated 3 months ago
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆223Updated 3 years ago
DravenALG / ReSTE
(ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).
☆29Updated 10 months ago
nbasyl / OFQ
The official implementation of the ICML 2023 paper OFQ-ViT
☆33Updated last year
ZouJiu1 / LSQplus
LSQ+ or LSQplus
☆70Updated 6 months ago
lihuantong / HAST
☆12Updated last year
Mohamed-Imed-Eddine / Harmonic-NAS
Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices (ACML 2023)
☆14Updated last year
z-hXu / ReCU
Pytorch implementation of our paper accepted by ICCV 2021 -- ReCU: Reviving the Dead Weights in Binary Neural Networks http://arxiv.org/a…
☆39Updated 3 years ago
YanjingLi0202 / Q-ViT
The official implementation of the NeurIPS 2022 paper Q-ViT.
☆96Updated 2 years ago
hustzxd / LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆137Updated 4 years ago
megvii-research / FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆347Updated 2 years ago
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆91Updated last year
yanghr / BSQ
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆41Updated 4 years ago
KwangHoonAn / PACT
Reproducing Quantization paper PACT
☆64Updated 3 years ago
GoatWu / APHQ-ViT
[CVPR 2025] APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
☆27Updated 4 months ago
liuzechun / Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆133Updated 3 years ago
xuke225 / EQ-Net
EQ-Net [ICCV 2023]
☆30Updated last year
Phuoc-Hoan-Le / BinaryViT
BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models
☆37Updated last year
HuangOwen / Quantization-Variation
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆45Updated 10 months ago
jeffreyyu0602 / quantized-training
☆29Updated last week
xidongwu / AutoTrainOnce
☆17Updated 10 months ago
ok858ok / CP-ViT
Code for "CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction" on CIFAR-10/100.
☆14Updated 3 years ago
aojunzz / DominoSearch
☆19Updated 3 years ago
MXHX7199 / ICCV_2021_AFP
AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.
☆13Updated 3 years ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆282Updated 4 years ago
Cydia2018 / ViT-cifar10-pruning
Vision Transformer Pruning
☆57Updated 3 years ago
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆57Updated 2 years ago
hpi-xnor / BNext
Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket
☆70Updated 2 years ago
zhutmost / lsq-net
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆300Updated last year
wimh966 / QDrop
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆123Updated 2 years ago