Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
☆137Dec 18, 2025Updated 2 months ago
Alternatives and similar repositories for SVG-T2I
Users that are interested in SVG-T2I are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆166Oct 21, 2025Updated 4 months ago
- DreamStyle: A Unified Framework for Video Stylization☆109Jan 7, 2026Updated 2 months ago
- [CVPR 2026] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos☆393Feb 26, 2026Updated last week
- ☆56Nov 12, 2025Updated 3 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 3 weeks ago
- ☆41Oct 29, 2025Updated 4 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformers☆174Jan 5, 2026Updated 2 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆167Dec 11, 2025Updated 2 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 6 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆75Aug 2, 2025Updated 7 months ago
- ☆190Dec 10, 2025Updated 2 months ago
- ☆130Dec 19, 2025Updated 2 months ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Overworld's local world client interface to run Waypoint world models☆46Mar 1, 2026Updated last week
- Rethinking the Trust Region in LLM Reinforcement Learning☆39Feb 25, 2026Updated last week
- ☆15Sep 22, 2025Updated 5 months ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆417Jul 25, 2025Updated 7 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆212Nov 25, 2025Updated 3 months ago
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆649Oct 16, 2024Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 5 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆69Feb 26, 2026Updated last week
- [SIGGRAPH-ASIA 2025] Official implementation of "VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Model…☆126Oct 27, 2025Updated 4 months ago
- This is the official repository of UltraHR-100K.☆44Nov 21, 2025Updated 3 months ago
- Code2Worlds: Empowering Coding LLMs for 4D World Generation☆87Feb 26, 2026Updated last week
- ☆65Jul 10, 2025Updated 7 months ago
- Inference server for MioTTS, a lightweight and fast LLM-based TTS model.☆103Feb 14, 2026Updated 3 weeks ago
- ☆17Jun 14, 2024Updated last year
- ☆37Jun 4, 2025Updated 9 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆245Aug 15, 2025Updated 6 months ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆534Sep 8, 2025Updated 6 months ago
- [ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆32Mar 1, 2026Updated last week
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Sep 13, 2024Updated last year
- ☆20Jan 1, 2026Updated 2 months ago
- Official implementation for "Nested Attention: Semantic-aware Attention Values for Concept Personalization" [SIGGRAPH 2025]☆27Aug 4, 2025Updated 7 months ago
- ☆19Apr 16, 2025Updated 10 months ago
- LAizypainter is a Photoshop plugin with which you can send tasks directly to a Stable Diffusion server.☆21Jul 25, 2024Updated last year