[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
☆78Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for SpatialT2I
Users that are interested in SpatialT2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- Benchmark dataset and code of MSRVTT-Personalization☆51Nov 10, 2025Updated 4 months ago
- [CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration☆101Mar 15, 2026Updated last week
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆63Mar 3, 2026Updated 3 weeks ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2026] Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video☆95Mar 5, 2026Updated 3 weeks ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆117Nov 27, 2025Updated 3 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 4 months ago
- (Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratio…☆26Mar 9, 2025Updated last year
- ☆86Feb 4, 2026Updated last month
- Rethinking Video Generation Model for the Embodied World☆54Feb 12, 2026Updated last month
- [ACM MM 24 Best Paper Nomination] ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images☆24Dec 9, 2024Updated last year
- End2End Virtual Try-on with Visual Reference, CVPR2026☆58Mar 17, 2026Updated last week
- Scaling Zero-Shot Reference-to-Video Generation☆64Dec 11, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Memory-Augmented Deep Unfolding Network for Compressive Sensing(ACMMM 2021)☆29Nov 24, 2023Updated 2 years ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆88Feb 17, 2026Updated last month
- Accepted by TPAMI 2022☆37Dec 6, 2022Updated 3 years ago
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆104Feb 8, 2026Updated last month
- 🌋LavaSR: Fast Speech restoration and enhancement☆482Mar 10, 2026Updated 2 weeks ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 6 months ago
- The official repo for the DanQing dataset.☆32Jan 16, 2026Updated 2 months ago
- Official code for our CVPR 2021 paper: "When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks".☆33Jul 6, 2021Updated 4 years ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆40Oct 30, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 10 months ago
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels☆174Mar 11, 2026Updated 2 weeks ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated last year
- [ECCV2024] RS-NeRF: Neural Radiance Fields from Rolling Shutter Images☆16Jul 16, 2024Updated last year
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks☆141Oct 20, 2025Updated 5 months ago
- GaussianDreamer extension of threestudio.☆50Apr 12, 2024Updated last year
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)☆278Dec 5, 2025Updated 3 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆173Dec 11, 2025Updated 3 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆15Feb 13, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Generate high resolution videos with a custom voice and appearance, based on LTX-2/LTX-2.3 + Identity In-Context LoRA☆158Updated this week
- [TCSVT 2024] D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing☆43Dec 9, 2024Updated last year
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆53Feb 23, 2026Updated last month
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆173Feb 4, 2026Updated last month
- Why do deep convolutional networks generalize so poorly to small image transformations?☆11Jun 23, 2019Updated 6 years ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆124Jan 25, 2025Updated last year
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago