KlingTeam / SVG-T2ILinks
Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
☆118Updated 3 weeks ago
Alternatives and similar repositories for SVG-T2I
Users that are interested in SVG-T2I are comparing it to the libraries listed below
Sorting:
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆172Updated 3 months ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆49Updated 9 months ago
- VideoCoF: Unified Video Editing with Temporal Reasoner☆122Updated this week
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆86Updated 7 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆241Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 7 months ago
- Make self forcing endless. Add cache purging. Add prompt controllability.☆68Updated 3 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆109Updated 3 months ago
- Official Repo for Self-Forcing++ High Quality Long Video Generation☆216Updated 2 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆114Updated last year
- ☆57Updated 2 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆147Updated 2 months ago
- Vision Bridge Transformer at Scale☆133Updated last month
- Official Implementations for Paper - MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues☆106Updated last month
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆261Updated 5 months ago
- ☆29Updated 9 months ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆84Updated 3 months ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆112Updated 3 months ago
- Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"☆150Updated last week
- 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆87Updated 2 weeks ago
- ☆132Updated 6 months ago
- Implementation Code for Omni-Effects☆163Updated 3 weeks ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆204Updated 3 weeks ago
- ☆47Updated 8 months ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆123Updated 11 months ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆25Updated last month
- [[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions☆78Updated 5 months ago
- ☆52Updated 3 weeks ago
- 4-steps distilled version of Wan2.2-TI2V-5B☆129Updated 3 months ago
- [ICCV 2025] Balanced Image Stylization with Style Matching Score☆66Updated 3 months ago