KlingTeam / SVG-T2ILinks

Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

☆132

Alternatives and similar repositories for SVG-T2I

Users that are interested in SVG-T2I are comparing it to the libraries listed below

Sorting:

knightyxp / VideoCoF
VideoCoF: Unified Video Editing with Temporal Reasoner
☆138Updated last month
rongyaofang / prism-bench
This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…
☆121Updated 2 weeks ago
desaixie / pa_vdm
CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
☆90Updated 9 months ago
wyhlovecpp / GPT-Image-Edit
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆244Updated 5 months ago
KlingTeam / VANS
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
☆92Updated 2 months ago
Gen-Verse / Diffusion-Sharpening
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
☆69Updated 8 months ago
ChocoWu / Any2Caption
This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation
☆49Updated 10 months ago
arthur-qiu / FreeTraj
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
☆111Updated 4 months ago
dc-ai-projects / DC-VideoGen
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
☆179Updated 4 months ago
TencentARC / BlobCtrl
[SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing
☆26Updated 2 months ago
showlab / SMS
[ICCV 2025] Balanced Image Stylization with Style Matching Score
☆67Updated 4 months ago
baaivision / URSA
[ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation
☆102Updated this week
KaiyueSun98 / T2I-Personalization-with-AR
☆47Updated 9 months ago
YujiaHu1109 / IEAP
[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models
☆112Updated 4 months ago
Dere-Wah / Self-Forcing-Endless
Make self forcing endless. Add cache purging. Add prompt controllability.
☆69Updated 5 months ago
EnVision-Research / OmniBooth
☆133Updated 10 months ago
justincui03 / Self-Forcing-Plus-Plus
Official Repo for Self-Forcing++ High Quality Long Video Generation
☆233Updated 4 months ago
Bujiazi / HiFlow
[NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
☆84Updated 4 months ago
AMAP-ML / S2-Guidance
[ICLR2026] Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"
☆152Updated last week
ali-vilab / FreeScale
[ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation
☆148Updated 4 months ago
alibaba-damo-academy / Lumos
[ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.
☆152Updated 2 weeks ago
poppuppy / SAR
☆35Updated last month
KlingTeam / MemFlow
Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"
☆185Updated last month
quanhaol / Wan2.2-TI2V-5B-Turbo
4-steps distilled version of Wan2.2-TI2V-5B
☆137Updated 2 weeks ago
Bujiazi / ByTheWay
[CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
☆46Updated 4 months ago
KAIST-Visual-AI-Group / Flow-Inference-Time-Scaling
[NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
☆72Updated 4 months ago
Yuanshi9815 / ViBT
Vision Bridge Transformer at Scale
☆138Updated 2 months ago
TIGER-AI-Lab / EditReward
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]
☆118Updated this week
pPetrichor / WorldCanvas
☆128Updated last month
PKU-YuanGroup / Edit-R1
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆228Updated 2 weeks ago