Kai-Liu001 / Awesome-Model-QuantizationLinks
☆19Updated last month
Alternatives and similar repositories for Awesome-Model-Quantization
Users that are interested in Awesome-Model-Quantization are comparing it to the libraries listed below
Sorting:
- ☆12Updated 3 months ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆45Updated 9 months ago
- PyTorch code for our paper "ARB-LLM: Alternating Refined Binarizations for Large Language Models"☆24Updated 3 months ago
- PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005☆30Updated 7 months ago
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆12Updated 7 months ago
- PyTorch code for our paper "2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution"☆35Updated 8 months ago
- super-resolution; post-training quantization; model compression☆12Updated last year
- ☆10Updated 4 months ago
- ☆22Updated 3 weeks ago
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆37Updated last year
- BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models☆37Updated last year
- [CVPR 2025] APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers☆23Updated 2 months ago
- ☆13Updated 4 months ago
- QuEST: Efficient Finetuning for Low-bit Diffusion Models☆45Updated 5 months ago
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆31Updated last year
- ptq4vm official repository☆22Updated 2 months ago
- [NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low…☆48Updated last year
- LLM Inference with Microscaling Format☆23Updated 7 months ago
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆93Updated 2 years ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆91Updated last year
- LSQ+ or LSQplus☆69Updated 4 months ago
- [CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…☆65Updated last week
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆29Updated 6 months ago
- ViTALiTy (HPCA'23) Code Repository☆23Updated 2 years ago
- The official implementation of the DAC 2024 paper GQA-LUT☆18Updated 6 months ago
- ☆10Updated last year
- DeiT implementation for Q-ViT☆25Updated 2 months ago
- ☆13Updated 3 months ago
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆15Updated 9 months ago
- The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"☆40Updated 3 months ago