TaylorJocelyn/Diffusion-Model-Quantization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TaylorJocelyn/Diffusion-Model-Quantization)

TaylorJocelyn / Diffusion-Model-Quantization

☆64

Alternatives and similar repositories for Diffusion-Model-Quantization

Users that are interested in Diffusion-Model-Quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Bujiazi / DiCache
View on GitHub
[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache
☆61Jan 26, 2026Updated 5 months ago
wlfeng0509 / Awesome-Diffusion-Quantization
View on GitHub
A list of papers, docs, codes about diffusion quantization.This repo collects various quantization methods for the Diffusion Models. Welc…
☆20Feb 2, 2026Updated 5 months ago
ugonfor / DGQ
View on GitHub
[ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models
☆19Mar 25, 2025Updated last year
bytedance / ERTACache
View on GitHub
☆24Sep 4, 2025Updated 10 months ago
mingzeG / DropCov
View on GitHub
Implementation of DropCov as described in DropCov: A Simple yet Effective Method for Improving Deep Architectures
☆10Oct 15, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
xlite-dev / Awesome-DiT-Inference
View on GitHub
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
☆578Jun 13, 2026Updated last month
lliai / Awesome-Efficient-Diffusion-Models
View on GitHub
Paper survey of efficient computation for large scale models.
☆34Dec 7, 2024Updated last year
adreamwu / PTQ4DiT
View on GitHub
PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005
☆49Nov 8, 2024Updated last year
thu-nics / ViDiT-Q
View on GitHub
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆163Mar 21, 2025Updated last year
ModelTC / TFMQ-DM
View on GitHub
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…
☆110Sep 29, 2025Updated 9 months ago
Tammytcl / Awesome-Diffusion-Acceleration-Cache
View on GitHub
A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration tech…
☆86Nov 4, 2025Updated 8 months ago
lhxcs / DVD-Quant
View on GitHub
☆17Oct 5, 2025Updated 9 months ago
Tencent-Hunyuan / Tencent-Hunyuan-7B-0124
View on GitHub
☆29Aug 21, 2025Updated 11 months ago
ezyang / ai-blindspots
View on GitHub
Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.
☆13Mar 20, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Kai-Liu001 / Awesome-One-Step-Diffusion
View on GitHub
☆16May 20, 2025Updated last year
lmbxmu / CutDiffusion
View on GitHub
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
☆27Oct 9, 2025Updated 9 months ago
hatchetProject / QuEST
View on GitHub
[ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models
☆60Jun 26, 2025Updated last year
Xingyu-Zheng / BiDM
View on GitHub
(NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models
☆22Updated this week
NoakLiu / FastCache-xDiT
View on GitHub
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]
☆52Apr 29, 2026Updated 2 months ago
kemingy / rabitq
View on GitHub
rabitq rust implementation
☆11May 14, 2026Updated 2 months ago
Shenyi-Z / TaylorSeer
View on GitHub
[ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
☆406Mar 2, 2026Updated 4 months ago
1157942086 / CVPR2020_Auxiliary_Quantization
View on GitHub
Training Quantized Neural Networks with a Full-precision Auxiliary Module
☆13Jun 19, 2020Updated 6 years ago
AI-Infra-Team / awesome-papers
View on GitHub
Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.
☆69Mar 4, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ThisisBillhe / ZipAR
View on GitHub
[ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…
☆51Mar 25, 2025Updated last year
csguoh / IntLoRA
View on GitHub
[ICML2025] LoRA fine-tune directly on the INT4 models.
☆41Nov 25, 2024Updated last year
ExplainableML / HyperNoise
View on GitHub
☆70Dec 5, 2025Updated 7 months ago
chengzeyi / ParaAttention
View on GitHub
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
☆427Jul 5, 2025Updated last year
real-hjq / MS-SWD
View on GitHub
[ECCV 2024] Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures
☆35Updated this week
CFinTech / SparseSSM
View on GitHub
[arxiv 2025] SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot
☆21Oct 8, 2025Updated 9 months ago
azuresky03 / distill_wan2.1
View on GitHub
☆26May 30, 2025Updated last year
thu-nics / DiTFastAttn
View on GitHub
☆192Jan 14, 2025Updated last year
thu-nics / PM-KVQ
View on GitHub
The official code implementation for paper "PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs"
☆29May 24, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rohitgandikota / distillation
View on GitHub
Distilling Diversity and Control in Diffusion Models
☆53Apr 28, 2025Updated last year
mi150 / VaLoRA
View on GitHub
☆11May 19, 2025Updated last year
yikuizhai / DGMA2-Net
View on GitHub
Implementation of DGMA2-Net: A Difference-Guided Mutiscale Aggregation Attention Network for Remote Sensing Image Change Detection
☆17Sep 17, 2024Updated last year
edward3862 / Analogist
View on GitHub
Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)
☆38Sep 10, 2024Updated last year
mitkotak / fast_flops
View on GitHub
FLOPS counter for all your GPU benchmarking needs
☆13Aug 8, 2024Updated last year
Zehong-Ma / MagCache
View on GitHub
The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"
☆275Nov 17, 2025Updated 8 months ago
luping-liu / Detector-Guidance
View on GitHub
The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)
☆20Feb 7, 2024Updated 2 years ago