Xingyu-Zheng / BiDMLinks
(NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models
☆21Updated 7 months ago
Alternatives and similar repositories for BiDM
Users that are interested in BiDM are comparing it to the libraries listed below
Sorting:
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆21Updated 9 months ago
- ☆14Updated 3 months ago
- [CVPR 2025] Efficient Personalization of Quantized Diffusion Model without Backpropagation☆14Updated 3 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆43Updated last year
- TerDiT: Ternary Diffusion Models with Transformers☆71Updated last year
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆47Updated this week
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆23Updated 3 months ago
- The official repo of continuous speculative decoding☆27Updated 3 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆107Updated last year
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆21Updated 3 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- [ICML2025] LoRA fine-tune directly on the quantized models.☆31Updated 7 months ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆62Updated last year
- ☆70Updated 7 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆30Updated 2 months ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆20Updated last week
- DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling☆33Updated last month
- ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆46Updated last month
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆38Updated 4 months ago
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆25Updated 4 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 8 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆30Updated 3 months ago
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆37Updated 4 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated 8 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆11Updated 5 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆31Updated last month
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆130Updated 3 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆102Updated 3 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆50Updated 3 months ago
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆52Updated last month