[CVPR 2026π₯] Enhancing Spatial Understanding in Image Generation via Reward Modeling
β85Mar 2, 2026Updated 4 months ago
Alternatives and similar repositories for SpatialT2I
Users that are interested in SpatialT2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)β50Apr 19, 2026Updated 2 months ago
- [AAAI 2026] UltraGenβ78Feb 1, 2026Updated 5 months ago
- Benchmark dataset and code of MSRVTT-Personalizationβ52Nov 10, 2025Updated 7 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generationβ57May 8, 2026Updated last month
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planningβ90Mar 26, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.β48Jun 1, 2026Updated last month
- Assessing Context-Aware Creative Intelligence in MLLMsβ23Jul 22, 2025Updated 11 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β75Feb 26, 2026Updated 4 months ago
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preβ¦β101Jun 7, 2026Updated 3 weeks ago
- [CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Accelerationβ125Apr 30, 2026Updated 2 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"β43Mar 24, 2026Updated 3 months ago
- [ECCV 2026] Track4World: Feedforward World-centric Dense 3D Tracking of All Pixelsβ248Updated this week
- (Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratioβ¦β27Mar 9, 2025Updated last year
- β90May 13, 2026Updated last month
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β183Aug 14, 2025Updated 10 months ago
- [CVPR 2026] Spatio-Temporal Autoregressive 4K 360Β° Video Generation from Perspective Videoβ124Mar 24, 2026Updated 3 months ago
- [CVPR 2026] PSDesigner: Automated Graphic Design with a Human-Like Creative Workflowβ145Mar 28, 2026Updated 3 months ago
- [NeurIPS 25] Official Implementation of TPP-SD: Accelerating Transformer Point Process Sampling with Speculative Decodingβ49Nov 17, 2025Updated 7 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026β68Apr 18, 2026Updated 2 months ago
- Memory-Augmented Deep Unfolding Network for Compressive Sensing(ACMMM 2021)β30Nov 24, 2023Updated 2 years ago
- β20Sep 17, 2024Updated last year
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generationβ76Apr 28, 2026Updated 2 months ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memoriesβ94Feb 17, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML 2026π₯]Rethinking Video Generation Model for the Embodied Worldβ74Jun 1, 2026Updated last month
- Gen-Searcher: Reinforcing Agentic Search for Image Generationβ371Apr 7, 2026Updated 2 months ago
- β28Mar 17, 2026Updated 3 months ago
- [ICLRβ26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Controlβ108Feb 8, 2026Updated 4 months ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Modelsβ47Oct 30, 2025Updated 8 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisβ65May 7, 2025Updated last year
- (TIP 2022) Content-Aware Scalable Deep Compressed Sensing [PyTorch]β47Mar 9, 2025Updated last year
- [ECCV2024] RS-NeRF: Neural Radiance Fields from Rolling Shutter Imagesβ16Jul 16, 2024Updated last year
- Combined InstantIDπ₯ and FouriScale to generate high resolution image!β11Apr 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GaussianDreamer extension of threestudio.β49Apr 12, 2024Updated 2 years ago
- πLavaSR: Fast Speech restoration and enhancementβ560Jun 19, 2026Updated 2 weeks ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.β22Sep 24, 2025Updated 9 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generationβ16Feb 13, 2026Updated 4 months ago
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masksβ177Apr 2, 2026Updated 3 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Controlβ189Dec 11, 2025Updated 6 months ago
- TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstructionβ325Jun 12, 2026Updated 3 weeks ago