mit-han-lab / nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
☆246Updated this week
Related projects ⓘ
Alternatives and complementary repositories for nunchaku
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆324Updated 3 weeks ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆161Updated last month
- I'm back! Implementations of Meissonic developed by Community☆207Updated this week
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆238Updated last month
- TerDiT: Ternary Diffusion Models with Transformers☆61Updated 4 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆359Updated 2 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆259Updated this week
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆204Updated last month
- ☆122Updated last month
- Model Compression Toolbox for Large Language Models and Diffusion Models☆188Updated this week
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆163Updated 3 months ago
- ☆97Updated last month
- FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆136Updated this week
- Official Implementation of weights2weights☆121Updated last month
- faster parallel inference of mochi video generation model☆53Updated this week
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆30Updated 2 months ago
- Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed☆378Updated this week
- [ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.☆327Updated 7 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆588Updated last week
- ☆42Updated 8 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆71Updated 3 months ago
- Faster generation with text-to-image diffusion models.☆193Updated last month
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆79Updated last week
- Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.☆153Updated 3 months ago
- Official Repository of the paper "Trajectory Consistency Distillation"☆318Updated 6 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆495Updated 2 months ago
- Patch convolution to avoid large GPU memory usage of Conv2D☆79Updated 5 months ago
- IP Adapter Instruct☆182Updated 3 months ago
- text to image to generation: CogView3-Plus and CogView3(ECCV 2024)☆243Updated 3 weeks ago
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆317Updated last month