KlingTeam / SVG-T2ILinks
Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
β70Updated this week
Alternatives and similar repositories for SVG-T2I
Users that are interested in SVG-T2I are comparing it to the libraries listed below
Sorting:
- π» Uniform Discrete Diffusion with Metric Path for Video Generationβ81Updated last week
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Compreheβ¦β111Updated 3 months ago
- β34Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"β73Updated 11 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151β86Updated 7 months ago
- Video Diffusion Transformers are In-Context Learnersβ35Updated 11 months ago
- Official Repo for Self-Forcing++ High Quality Long Video Generationβ209Updated 2 months ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidanceβ84Updated 3 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generationβ108Updated 2 months ago
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiTβ155Updated last month
- β27Updated 3 months ago
- Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.β147Updated 5 months ago
- [Neurips 2025 NextVid Workshop Oralβ¨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimβ¦β53Updated 2 months ago
- β51Updated last year
- β121Updated 3 months ago
- β47Updated 7 months ago
- Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillationβ163Updated this week
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editingβ71Updated 5 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflectionβ53Updated 4 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Textβ53Updated 9 months ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learningβ101Updated 6 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ184Updated 8 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controllerβ49Updated 4 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Attenβ¦β64Updated 5 months ago
- β85Updated 4 months ago
- Vico: Compositional Video Generation as Flow Equalizationβ58Updated last year
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesisβ62Updated 7 months ago
- β138Updated 2 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Datasetβ237Updated 4 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)β85Updated 9 months ago