zai-org / SSVAELinks
official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".
☆45Updated last month
Alternatives and similar repositories for SSVAE
Users that are interested in SSVAE are comparing it to the libraries listed below
Sorting:
- ☆132Updated 7 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Updated 5 months ago
- ☆93Updated this week
- An official implementation of SwapAnyone.☆74Updated 10 months ago
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆56Updated 3 weeks ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 8 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (ICLR 2026)☆40Updated 6 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆73Updated 3 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated last year
- ☆31Updated 5 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Updated 4 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆133Updated last week
- Make self forcing endless. Add cache purging. Add prompt controllability.☆69Updated 5 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Updated last month
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Updated 6 months ago
- Distilling Diversity and Control in Diffusion Models☆50Updated 9 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆132Updated last month
- Glance: Accelerating Diffusion Models with 1 Sample☆152Updated last month
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆82Updated 6 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated last year
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Updated 6 months ago
- VideoCoF: Unified Video Editing with Temporal Reasoner☆134Updated last month
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆112Updated 4 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆33Updated 4 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆80Updated 9 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Updated 10 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆121Updated 11 months ago
- ☆34Updated 3 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆59Updated last month
- ☆34Updated 10 months ago