[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
☆51Mar 2, 2026Updated this week
Alternatives and similar repositories for SpatialT2I
Users that are interested in SpatialT2I are comparing it to the libraries listed below
Sorting:
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 7 months ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆40Oct 30, 2025Updated 4 months ago
- ☆43Aug 31, 2025Updated 6 months ago
- a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆75Feb 7, 2026Updated 3 weeks ago
- Official repository for TikTok-DeepFake (TT-DF)☆13Feb 17, 2025Updated last year
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- Tools for using the Kinect One (Kinect v2) in ROS☆10Nov 19, 2020Updated 5 years ago
- Hand Mesh Recovery models on OakInk-Image dataset☆12Apr 4, 2024Updated last year
- A Prompt Learning Framework for Source Code Summarization☆14Dec 26, 2023Updated 2 years ago
- https://github.com/OSVR/distortionizer --- This is a modification of OSVR distortionizer used to modify SteamVR HMD config files to fine …☆12Oct 3, 2020Updated 5 years ago
- ☆10Dec 29, 2021Updated 4 years ago
- ☆11Jul 3, 2019Updated 6 years ago
- Official implementation for GATSBI: Generative Agent-centric Spatio-temporal Object Interaction (CVPR'2021)☆12Mar 23, 2022Updated 3 years ago
- This repo contains the code to reproduce our results in CVPR21 Challenge on Agriculture-Vision.☆10Jan 3, 2022Updated 4 years ago
- This repository open-sources CreatiPoster, an AI-driven graphic design generation system for multi-layer and editable compositions with s…☆81Jun 14, 2025Updated 8 months ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- Implementation of FaceBaker: Baking Character Facial Rigs with Machine Learning☆11Jun 28, 2020Updated 5 years ago
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"☆47Feb 26, 2026Updated last week
- 100 game demos by Crossin的编程教室☆15Jun 4, 2025Updated 9 months ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 11 months ago
- ☆11Feb 26, 2024Updated 2 years ago
- ☆13Jan 22, 2025Updated last year
- [NeurIPS 2025] The official implementation of "MOTION: Multi-Sculpt Evolutionary Coarsening for Federated Continual Graph Learning"☆39Nov 18, 2025Updated 3 months ago
- Fall 2023 NJUSE Machine Learning Course -- Group Project: DeepEMD for LibFewShot☆10May 16, 2024Updated last year
- ☆13Mar 9, 2024Updated last year
- 《算法设计与分析(第2版)》黄宇编著 个人题解(部分)☆14Oct 28, 2023Updated 2 years ago
- ☆10Feb 6, 2022Updated 4 years ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 9 months ago
- ☆18Jul 26, 2024Updated last year
- The official repo for the DanQing dataset.☆30Jan 16, 2026Updated last month
- The official code for our paper StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset in IJCAI 2023.☆13Jul 17, 2024Updated last year
- [NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganog…☆11Sep 28, 2023Updated 2 years ago
- GitHub Markdown Admonition Syntax Plugin for Markdown It☆13Nov 29, 2023Updated 2 years ago
- Code repository for Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction, ICCV2023☆14Dec 18, 2025Updated 2 months ago
- Iterative Closest Point (ICP) algorithm implemented with Python.☆10Dec 25, 2017Updated 8 years ago
- ☆15Feb 18, 2023Updated 3 years ago
- ☆52Dec 13, 2024Updated last year
- ☆15Jul 24, 2024Updated last year
- ☆12Nov 7, 2020Updated 5 years ago