adreamwu / PTQ4DiTLinks

PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005

☆31

Alternatives and similar repositories for PTQ4DiT

Users that are interested in PTQ4DiT are comparing it to the libraries listed below

Sorting:

thu-nics / ViDiT-Q
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆103Updated 3 months ago
Hsu1023 / DuQuant
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
☆164Updated 9 months ago
hatchetProject / QuEST
[ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models
☆49Updated 3 weeks ago
BrotherHappy / OSTQuant
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…
☆67Updated 3 months ago
Intelligent-Computing-Lab-Panda / GPTAQ
Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)
☆51Updated last month
DZY122 / DiTAS
DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)
☆10Updated 7 months ago
thu-nics / MBQ
The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
☆46Updated 4 months ago
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆56Updated 2 years ago
ModelTC / TFMQ-DM
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…
☆102Updated last week
ZHITENGLI / ARB-LLM
PyTorch code for our paper "ARB-LLM: Alternating Refined Binarizations for Large Language Models"
☆25Updated 3 months ago
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆92Updated last year
Aaronhuang-778 / Mixture-Compressor-MoE
[ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More
☆47Updated 5 months ago
ChenMnZ / PrefixQuant
An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization
☆142Updated last month
GoatWu / APHQ-ViT
[CVPR 2025] APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
☆23Updated 3 months ago
Juanerx / Q-DiT
[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
☆53Updated 10 months ago
JingyangXiang / DFRot
[COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎：https://zhuanlan.zhihu.c…
☆24Updated 4 months ago
JunyiWuCode / QuantCache
☆12Updated last week
wimh966 / QDrop
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆122Updated 2 years ago
42Shawn / PTQ4DM
Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)
☆139Updated 2 years ago
thu-nics / qllm-eval
Code Repository of Evaluating Quantized Large Language Models
☆130Updated 10 months ago
yifu-ding / BGEMM-CUDA
This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!
☆15Updated 10 months ago
A-suozhang / MixDQ
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
☆14Updated 7 months ago
GoatWu / AdaLog
[ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer
☆29Updated 7 months ago
ThisisBillhe / EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…
☆62Updated last year
mit-han-lab / Block-Sparse-Attention
A sparse attention kernel supporting mix sparse patterns
☆256Updated 5 months ago
Intelligent-Computing-Lab-Panda / TesseraQ
☆22Updated 8 months ago
Efficient-ML / Awesome-Efficient-AIGC
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…
☆186Updated 5 months ago
ruikangliu / FlatQuant
[ICML 2025] Official PyTorch implementation of "FlatQuant: Flatness Matters for LLM Quantization"
☆141Updated last month
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆221Updated 2 years ago
thu-ml / Jetfire-INT8Training
☆52Updated 11 months ago