[CVPR 2026π₯] Enhancing Spatial Understanding in Image Generation via Reward Modeling
β80Mar 2, 2026Updated last month
Alternatives and similar repositories for SpatialT2I
Users that are interested in SpatialT2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β84Mar 16, 2026Updated last month
- [AAAI 2026] UltraGenβ78Feb 1, 2026Updated 2 months ago
- Benchmark dataset and code of MSRVTT-Personalizationβ52Nov 10, 2025Updated 5 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.β31Mar 23, 2026Updated 3 weeks ago
- Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidanceβ15Nov 27, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Accelerationβ114Mar 15, 2026Updated last month
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β70Feb 26, 2026Updated last month
- Assessing Context-Aware Creative Intelligence in MLLMsβ23Jul 22, 2025Updated 8 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/β117Nov 27, 2025Updated 4 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"β41Mar 24, 2026Updated 3 weeks ago
- [CVPR 2026] PSDesigner: Automated Graphic Design with a Human-Like Creative Workflowβ122Mar 28, 2026Updated 2 weeks ago
- (Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratioβ¦β26Mar 9, 2025Updated last year
- [CVPR 2026] Spatio-Temporal Autoregressive 4K 360Β° Video Generation from Perspective Videoβ107Mar 24, 2026Updated 3 weeks ago
- β88Feb 4, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β182Aug 14, 2025Updated 8 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026β61Mar 29, 2026Updated 2 weeks ago
- Memory-Augmented Deep Unfolding Network for Compressive Sensing(ACMMM 2021)β29Nov 24, 2023Updated 2 years ago
- Gen-Searcher: Reinforcing Agentic Search for Image Generationβ257Apr 7, 2026Updated last week
- β20Sep 17, 2024Updated last year
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generationβ68Dec 11, 2025Updated 4 months ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memoriesβ90Feb 17, 2026Updated last month
- Imaging Tasks with Event Cameraβ33Jan 10, 2025Updated last year
- Accepted by TPAMI 2022β37Dec 6, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β45Aug 31, 2025Updated 7 months ago
- [ICLRβ26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Controlβ106Feb 8, 2026Updated 2 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.β21Sep 24, 2025Updated 6 months ago
- The official repo for the DanQing dataset.β34Mar 25, 2026Updated 3 weeks ago
- Official code for our CVPR 2021 paper: "When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks".β33Jul 6, 2021Updated 4 years ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Modelsβ43Oct 30, 2025Updated 5 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisβ65May 7, 2025Updated 11 months ago
- (TIP 2022) Content-Aware Scalable Deep Compressed Sensing [PyTorch]β45Mar 9, 2025Updated last year
- [ECCV2024] RS-NeRF: Neural Radiance Fields from Rolling Shutter Imagesβ16Jul 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- πLavaSR: Fast Speech restoration and enhancementβ508Apr 6, 2026Updated last week
- GaussianDreamer extension of threestudio.β49Apr 12, 2024Updated 2 years ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)β285Dec 5, 2025Updated 4 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generationβ15Feb 13, 2026Updated 2 months ago
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masksβ159Apr 2, 2026Updated 2 weeks ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Controlβ175Dec 11, 2025Updated 4 months ago
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixelsβ203Mar 11, 2026Updated last month