YoungHyun197 / ptq4vmLinks

ptq4vm official repository

☆22

Alternatives and similar repositories for ptq4vm

Users that are interested in ptq4vm are comparing it to the libraries listed below

Sorting:

Qualcomm-AI-research / outlier-free-transformers
☆45Updated last year
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆95Updated last year
HuangOwen / Quantization-Variation
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆46Updated last year
YanjingLi0202 / Q-ViT
The official implementation of the NeurIPS 2022 paper Q-ViT.
☆98Updated 2 years ago
Phuoc-Hoan-Le / BinaryViT
BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models
☆37Updated last year
Kai-Liu001 / Awesome-Model-Quantization
This repository contains low-bit quantization papers from 2020 to 2025 on top conference.
☆61Updated last month
htqin / BiBench
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…
☆55Updated last year
GATECH-EIC / ViTALiTy
ViTALiTy (HPCA'23) Code Repository
☆23Updated 2 years ago
ZouJiu1 / LSQplus
LSQ+ or LSQplus
☆73Updated 8 months ago
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆59Updated 2 years ago
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆228Updated 3 years ago
facebookresearch / bit
Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer
☆112Updated 2 years ago
liuzechun / Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆134Updated 3 years ago
iamkanghyunchoi / ait
It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher [CVPR 2022 Oral]
☆29Updated 3 years ago
parsa-epfl / quantization-sparsity-interplay
This repo contains the code for studying the interplay between quantization and sparsity methods
☆23Updated 8 months ago
naver-aics / lut-gemm
☆79Updated last year
GATECH-EIC / ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆122Updated 2 years ago
HuangOwen / RoLoRA
[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
☆38Updated last year
adreamwu / PTQ4DiT
PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005
☆37Updated 11 months ago
xvyaward / owq
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆66Updated last year
Intelligent-Computing-Lab-Panda / TesseraQ
☆23Updated 11 months ago
A-suozhang / MixDQ
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
☆14Updated 11 months ago
jmluu / Awesome-Efficient-Training
A collection of research papers on efficient training of DNNs
☆69Updated 3 years ago
aiha-lab / MX-QLLM
LLM Inference with Microscaling Format
☆31Updated 11 months ago
zhexinli / Q-ViT-DeiT
DeiT implementation for Q-ViT
☆25Updated 6 months ago
hatchetProject / QuEST
[ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models
☆53Updated 4 months ago
thu-ml / Jetfire-INT8Training
☆58Updated last year
Qualcomm-AI-research / FP8-quantization
☆163Updated 2 years ago
GATECH-EIC / ShiftAddViT
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Updated last year
wimh966 / QDrop
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆124Updated last month