KlingAIResearch / SVG-T2IView external linksLinks
Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
☆132Dec 18, 2025Updated last month
Alternatives and similar repositories for SVG-T2I
Users that are interested in SVG-T2I are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆164Oct 21, 2025Updated 3 months ago
- FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆24Updated this week
- DreamStyle: A Unified Framework for Video Stylization☆110Jan 7, 2026Updated last month
- ☆53Nov 12, 2025Updated 3 months ago
- ☆34Oct 29, 2025Updated 3 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformers☆171Jan 5, 2026Updated last month
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆166Dec 11, 2025Updated 2 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆34Feb 5, 2026Updated last week
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 5 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆75Aug 2, 2025Updated 6 months ago
- ☆185Dec 10, 2025Updated 2 months ago
- ☆130Dec 19, 2025Updated last month
- ☆15Sep 22, 2025Updated 4 months ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos☆342Jan 5, 2026Updated last month
- Overworld's local world client interface to run Waypoint world models☆44Updated this week
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆414Jul 25, 2025Updated 6 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆205Nov 25, 2025Updated 2 months ago
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆648Oct 16, 2024Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 4 months ago
- [SIGGRAPH-ASIA 2025] Official implementation of "VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Model…☆125Oct 27, 2025Updated 3 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated last month
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆70Jan 10, 2026Updated last month
- ☆35Jun 4, 2025Updated 8 months ago
- ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation☆50Updated this week
- ☆63Jul 10, 2025Updated 7 months ago
- ☆17Jun 14, 2024Updated last year
- Animate Any Character in Any World☆88Jan 9, 2026Updated last month
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆244Aug 15, 2025Updated 6 months ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆534Sep 8, 2025Updated 5 months ago
- LAizypainter is a Photoshop plugin with which you can send tasks directly to a Stable Diffusion server.☆21Jul 25, 2024Updated last year
- Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆31Jun 19, 2025Updated 7 months ago
- ☆20Jan 1, 2026Updated last month
- Java SDK for Z.ai Open Platform☆43Feb 2, 2026Updated 2 weeks ago
- [ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model☆110Jan 26, 2026Updated 3 weeks ago
- Official implementation for "Nested Attention: Semantic-aware Attention Values for Concept Personalization" [SIGGRAPH 2025]☆27Aug 4, 2025Updated 6 months ago
- ☆19Apr 16, 2025Updated 10 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Sep 13, 2024Updated last year