[CVPR 2026π₯] Enhancing Spatial Understanding in Image Generation via Reward Modeling
β82Mar 2, 2026Updated 2 months ago
Alternatives and similar repositories for SpatialT2I
Users that are interested in SpatialT2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β86Mar 16, 2026Updated last month
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)β42Apr 19, 2026Updated 2 weeks ago
- [AAAI 2026] UltraGenβ78Feb 1, 2026Updated 3 months ago
- Benchmark dataset and code of MSRVTT-Personalizationβ52Nov 10, 2025Updated 5 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generationβ47Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidanceβ15Nov 27, 2025Updated 5 months ago
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preβ¦β83Mar 3, 2026Updated 2 months ago
- [CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Accelerationβ116Updated this week
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β72Feb 26, 2026Updated 2 months ago
- Assessing Context-Aware Creative Intelligence in MLLMsβ23Jul 22, 2025Updated 9 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/β117Nov 27, 2025Updated 5 months ago
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixelsβ217Apr 27, 2026Updated last week
- [CVPR 2026] Spatio-Temporal Autoregressive 4K 360Β° Video Generation from Perspective Videoβ110Mar 24, 2026Updated last month
- (Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratioβ¦β27Mar 9, 2025Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β181Aug 14, 2025Updated 8 months ago
- [ACM MM 24 Best Paper Nomination] ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Imagesβ24Dec 9, 2024Updated last year
- End2End Virtual Try-on with Visual Reference, CVPR2026β63Apr 18, 2026Updated 2 weeks ago
- Rethinking Video Generation Model for the Embodied Worldβ60Feb 12, 2026Updated 2 months ago
- β20Sep 17, 2024Updated last year
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generationβ72Apr 28, 2026Updated last week
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memoriesβ91Feb 17, 2026Updated 2 months ago
- Imaging Tasks with Event Cameraβ34Jan 10, 2025Updated last year
- Accepted by TPAMI 2022β37Dec 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Gen-Searcher: Reinforcing Agentic Search for Image Generationβ316Apr 7, 2026Updated 3 weeks ago
- β48Aug 31, 2025Updated 8 months ago
- [ICLRβ26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Controlβ105Feb 8, 2026Updated 2 months ago
- The official repo for the DanQing dataset.β35Mar 25, 2026Updated last month
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Modelsβ43Oct 30, 2025Updated 6 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisβ66May 7, 2025Updated 11 months ago
- [ECCV2024] RS-NeRF: Neural Radiance Fields from Rolling Shutter Imagesβ16Jul 16, 2024Updated last year
- Combined InstantIDπ₯ and FouriScale to generate high resolution image!β11Apr 3, 2024Updated 2 years ago
- GaussianDreamer extension of threestudio.β49Apr 12, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.β22Sep 24, 2025Updated 7 months ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)β292Dec 5, 2025Updated 5 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generationβ15Feb 13, 2026Updated 2 months ago
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masksβ166Apr 2, 2026Updated last month
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Controlβ180Dec 11, 2025Updated 4 months ago
- Why do deep convolutional networks generalize so poorly to small image transformations?β11Jun 23, 2019Updated 6 years ago
- βοΈ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".β56Feb 23, 2026Updated 2 months ago