Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".
☆200Apr 13, 2025Updated 11 months ago
Alternatives and similar repositories for OptimalSteps
Users that are interested in OptimalSteps are comparing it to the libraries listed below
Sorting:
- Official repo for CFG-Zero*☆706May 2, 2025Updated 10 months ago
- Official PyTorch implementation of TokenSet.☆128Mar 21, 2025Updated last year
- Official PyTorch implementation of the paper "Equivariant Image Modeling"(https://arxiv.org/abs/2503.18948)☆36Aug 1, 2025Updated 7 months ago
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 9 months ago
- [ICLR'2026] Scale-wise Distillation of Diffusion Models☆118Mar 12, 2026Updated last week
- ComfyUI Custom Node for "Golden Noise for Diffusion Models: A Learning Framework". This node refines the initial latent noise in the diff…☆23Mar 28, 2025Updated 11 months ago
- Implementing FlowEdit, maybe other inversion techniques for the Wan video generation model☆54Feb 28, 2025Updated last year
- Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!☆47Jun 2, 2025Updated 9 months ago
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip☆37Jan 27, 2026Updated last month
- 🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆32May 17, 2025Updated 10 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- ComfyUI unofficial implementation of Thera - Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆36Jan 2, 2026Updated 2 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,292Jun 8, 2025Updated 9 months ago
- ☆65May 3, 2025Updated 10 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆425Jul 5, 2025Updated 8 months ago
- [ICLR 2025] You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs☆71Mar 7, 2026Updated last week
- MoD Control Tile Upscaler for SDXL Pipeline☆61Mar 8, 2025Updated last year
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Enhance-A-Video: Better Generated Video for Free☆593Mar 17, 2025Updated last year
- A collection of custom nodes for ComfyUI.☆28Jun 28, 2025Updated 8 months ago
- Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)☆238Mar 21, 2025Updated last year
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆646Mar 6, 2026Updated 2 weeks ago
- [Preprint] UCGM: Unified Continuous Generative Models☆183May 27, 2025Updated 9 months ago
- ☆66May 22, 2025Updated 9 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆706Jun 3, 2025Updated 9 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆79Jul 19, 2025Updated 8 months ago
- ☆239May 9, 2025Updated 10 months ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,269Aug 7, 2025Updated 7 months ago
- create interpolation pose images between two image,☆33Jun 3, 2025Updated 9 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,498Sep 11, 2025Updated 6 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆214Sep 27, 2025Updated 5 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆261Dec 27, 2024Updated last year
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 5 months ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆475Dec 6, 2025Updated 3 months ago
- ComfyUI-Bagel is now available in ComfyUI, BAGEL is an open‑source multimodal foundation model with 7B active parameters (14B total) trai…☆29May 28, 2025Updated 9 months ago
- Bloom image post processing effect for ComfyUI. Soft and fast Gaussian Blur bloom, box blur for speed, star pattern support. Uses GPU and…☆66Jul 10, 2025Updated 8 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆842Apr 27, 2025Updated 10 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,247Updated this week
- ComfyUI implementation of Long-CLIP☆165Mar 8, 2025Updated last year