enyac-group / evol-qLinks

Quantization in the Jagged Loss Landscape of Vision Transformers

☆13

Alternatives and similar repositories for evol-q

Users that are interested in evol-q are comparing it to the libraries listed below

Sorting:

HuangOwen / Quantization-Variation
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆45Updated 9 months ago
zhexinli / Q-ViT-DeiT
DeiT implementation for Q-ViT
☆25Updated 2 months ago
tinganchen / AlignQ
[CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation
☆11Updated 2 years ago
Qualcomm-AI-research / oscillations-qat
☆76Updated 2 years ago
wimh966 / outlier_suppression
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆47Updated 2 years ago
z-hXu / ReCU
Pytorch implementation of our paper accepted by ICCV 2021 -- ReCU: Reviving the Dead Weights in Binary Neural Networks http://arxiv.org/a…
☆39Updated 3 years ago
huqinghao / PalQuant
☆12Updated 2 years ago
Qualcomm-AI-research / BayesianBits
☆20Updated 3 years ago
yanghr / BSQ
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆40Updated 4 years ago
ModelTC / Outlier_Suppression_Plus
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆46Updated last year
aojunzz / DominoSearch
☆19Updated 3 years ago
wimh966 / QDrop
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆122Updated 2 years ago
kriskrisliu / NoisyQuant
An official implement of CVPR 2023 paper - NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
☆21Updated last year
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆56Updated 2 years ago
snap-research / F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
☆95Updated 3 years ago
YanjingLi0202 / Q-ViT
The official implementation of the NeurIPS 2022 paper Q-ViT.
☆96Updated 2 years ago
parsa-epfl / quantization-sparsity-interplay
This repo contains the code for studying the interplay between quantization and sparsity methods
☆21Updated 4 months ago
nbasyl / OFQ
The official implementation of the ICML 2023 paper OFQ-ViT
☆32Updated last year
jmluu / Awesome-Efficient-Training
A collection of research papers on efficient training of DNNs
☆70Updated 3 years ago
liuzechun / Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆133Updated 3 years ago
hustzxd / LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆135Updated 4 years ago
gilshm / sparq
Post-training sparsity-aware quantization
☆34Updated 2 years ago
Qualcomm-AI-research / outlier-free-transformers
☆42Updated last year
yaozhewei / HAP
☆43Updated last year
Qualcomm-AI-research / FP8-quantization
☆153Updated 2 years ago
ZouJiu1 / LSQplus
LSQ+ or LSQplus
☆69Updated 5 months ago
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆221Updated 2 years ago
xvyaward / owq
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆63Updated last year
deJQK / FracBits
Neural Network Quantization With Fractional Bit-widths
☆12Updated 4 years ago
htqin / BiBench
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…
☆56Updated last year