[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
☆84Mar 2, 2026Updated 2 months ago
Alternatives and similar repositories for SpatialT2I
Users that are interested in SpatialT2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆88Mar 16, 2026Updated 2 months ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 3 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 6 months ago
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆85Mar 26, 2026Updated last month
- UniMesh: Unifying 3D Mesh Understanding and Generation☆56May 8, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆45Mar 23, 2026Updated 2 months ago
- Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance☆16Nov 27, 2025Updated 5 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆73Feb 26, 2026Updated 2 months ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 10 months ago
- [CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration☆119Apr 30, 2026Updated 3 weeks ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆43Mar 24, 2026Updated 2 months ago
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels☆224Apr 27, 2026Updated 3 weeks ago
- [CVPR 2026] Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video☆116Mar 24, 2026Updated 2 months ago
- ☆89May 13, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- (Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratio…☆27Mar 9, 2025Updated last year
- [AAAI 2026 🔥] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆182Aug 14, 2025Updated 9 months ago
- [ACM MM 24 Best Paper Nomination] ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images☆24Dec 9, 2024Updated last year
- [CVPR 2026] PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow☆142Mar 28, 2026Updated last month
- [NeurIPS 25] Official Implementation of TPP-SD: Accelerating Transformer Point Process Sampling with Speculative Decoding☆49Nov 17, 2025Updated 6 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026☆64Apr 18, 2026Updated last month
- Memory-Augmented Deep Unfolding Network for Compressive Sensing(ACMMM 2021)☆30Nov 24, 2023Updated 2 years ago
- ☆20Sep 17, 2024Updated last year
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆72Apr 28, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆91Feb 17, 2026Updated 3 months ago
- Imaging Tasks with Event Camera☆34Jan 10, 2025Updated last year
- [ICML 2026🔥]Rethinking Video Generation Model for the Embodied World☆66May 4, 2026Updated 3 weeks ago
- Accepted by TPAMI 2022☆37Dec 6, 2022Updated 3 years ago
- Gen-Searcher: Reinforcing Agentic Search for Image Generation☆344Apr 7, 2026Updated last month
- ☆22Mar 17, 2026Updated 2 months ago
- ☆50Aug 31, 2025Updated 8 months ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆43Oct 30, 2025Updated 6 months ago
- (TIP 2022) Content-Aware Scalable Deep Compressed Sensing [PyTorch]☆45Mar 9, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ECCV2024] RS-NeRF: Neural Radiance Fields from Rolling Shutter Images☆16Jul 16, 2024Updated last year
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated 2 years ago
- 🌋LavaSR: Fast Speech restoration and enhancement☆535Apr 6, 2026Updated last month
- GaussianDreamer extension of threestudio.☆49Apr 12, 2024Updated 2 years ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 8 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆15Feb 13, 2026Updated 3 months ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)☆296Dec 5, 2025Updated 5 months ago