Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".
☆197Apr 13, 2025Updated 10 months ago
Alternatives and similar repositories for OptimalSteps
Users that are interested in OptimalSteps are comparing it to the libraries listed below
Sorting:
- Official repo for CFG-Zero*☆704May 2, 2025Updated 9 months ago
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip☆37Jan 27, 2026Updated last month
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 9 months ago
- [ICLR'2026] Scale-wise Distillation of Diffusion Models☆113Sep 17, 2025Updated 5 months ago
- Implementing FlowEdit, maybe other inversion techniques for the Wan video generation model☆54Feb 28, 2025Updated last year
- [ICLR 2025] You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs☆71Mar 11, 2025Updated 11 months ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- ComfyUI unofficial implementation of Thera - Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆36Jan 2, 2026Updated last month
- Official PyTorch implementation of TokenSet.☆128Mar 21, 2025Updated 11 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆235May 9, 2025Updated 9 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆215Sep 27, 2025Updated 5 months ago
- ☆33Aug 9, 2024Updated last year
- ☆64May 3, 2025Updated 9 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆701Jun 3, 2025Updated 8 months ago
- ComfyUI-Bagel is now available in ComfyUI, BAGEL is an open‑source multimodal foundation model with 7B active parameters (14B total) trai…☆29May 28, 2025Updated 9 months ago
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 4 months ago
- Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)☆237Mar 21, 2025Updated 11 months ago
- ComfyUI Custom Node for "Golden Noise for Diffusion Models: A Learning Framework". This node refines the initial latent noise in the diff…☆23Mar 28, 2025Updated 11 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆834Apr 27, 2025Updated 10 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,258Jun 8, 2025Updated 8 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆113Sep 27, 2025Updated 5 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆422Jul 5, 2025Updated 7 months ago
- MoD Control Tile Upscaler for SDXL Pipeline☆61Mar 8, 2025Updated 11 months ago
- Enhance-A-Video: Better Generated Video for Free☆594Mar 17, 2025Updated 11 months ago
- [CVPR 2025] Official implementation of the paper "Generative Inbetweening through Frame-wise Conditions-Driven Video Generation"☆115Feb 27, 2025Updated last year
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆191Dec 31, 2024Updated last year
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆627Feb 3, 2026Updated 3 weeks ago
- ☆66May 22, 2025Updated 9 months ago
- [CVPR 2026] Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers☆54Updated this week
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,481Sep 11, 2025Updated 5 months ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆454Dec 6, 2025Updated 2 months ago
- [ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.☆946Dec 31, 2025Updated 2 months ago
- ☆15Jun 1, 2025Updated 8 months ago
- Unofficial implementation of Face0 with SDXL☆12Sep 1, 2023Updated 2 years ago
- Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!☆47Jun 2, 2025Updated 8 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆259Dec 27, 2024Updated last year
- ☆119May 13, 2025Updated 9 months ago