Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
☆142Dec 18, 2025Updated 4 months ago
Alternatives and similar repositories for SVG-T2I
Users that are interested in SVG-T2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Sep 22, 2025Updated 6 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 7 months ago
- Code of Strips as Tokens: Artist Mesh Generation with Native UV Segmentation. ACM Transactions on Graphics (SIGGRAPH 2026)☆98Updated this week
- [ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆31Mar 1, 2026Updated last month
- Rethinking the Trust Region in LLM Reinforcement Learning☆51Mar 2, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆422Jul 25, 2025Updated 8 months ago
- Code2Worlds: Empowering Coding LLMs for 4D World Generation☆96Feb 26, 2026Updated last month
- Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models☆233Apr 10, 2026Updated last week
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆246Aug 15, 2025Updated 8 months ago
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆431Sep 18, 2025Updated 7 months ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆75Aug 2, 2025Updated 8 months ago
- ☆67Jul 10, 2025Updated 9 months ago
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆60Nov 12, 2025Updated 5 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆652Oct 16, 2024Updated last year
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆69Dec 11, 2025Updated 4 months ago
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆116Feb 5, 2026Updated 2 months ago
- ☆33Apr 22, 2025Updated 11 months ago
- [CVPR 2026 Highlight] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos☆502Apr 9, 2026Updated last week
- ThinkGen: Generalized Thinking for Visual Generation☆52Dec 30, 2025Updated 3 months ago
- DreamStyle: A Unified Framework for Video Stylization☆117Jan 7, 2026Updated 3 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆117Nov 27, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆25Mar 30, 2025Updated last year
- A simple aesthetic scorer + pruner + website you can run to view the results from the scoring with☆16Jun 3, 2024Updated last year
- Animate Any Character in Any World☆97Mar 10, 2026Updated last month
- ☆20Jan 1, 2026Updated 3 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆175Dec 11, 2025Updated 4 months ago
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation☆11Aug 13, 2024Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆31Feb 10, 2026Updated 2 months ago
- ☆136Dec 19, 2025Updated 4 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆222Nov 25, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆70Feb 26, 2026Updated last month
- ☆11Jun 5, 2023Updated 2 years ago
- ☆19Apr 16, 2025Updated last year
- A Unified Visual Generator with Interleaved OmniModal Context☆211Mar 5, 2026Updated last month
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆66Oct 16, 2024Updated last year