cfeng16 / GPS2PixLinks
[CVPR 2025] GPS as a Control Signal for Image Generation
☆24Updated 8 months ago
Alternatives and similar repositories for GPS2Pix
Users that are interested in GPS2Pix are comparing it to the libraries listed below
Sorting:
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆52Updated 3 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆75Updated 4 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆48Updated 7 months ago
- The official implementation of paper “VChain: Chain-of-Visual-Thought for Reasoning in Video Generation”☆101Updated last month
- Self-reimplemented version of 4D-LRM.☆62Updated 5 months ago
- ☆30Updated last year
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆45Updated 5 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆83Updated 8 months ago
- Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆85Updated 4 months ago
- (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''☆39Updated last year
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆21Updated 4 months ago
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆27Updated last year
- Official implementation of "URECA : Unique Region Caption Anything"☆54Updated 4 months ago
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆35Updated 10 months ago
- ☆34Updated last year
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆77Updated last year
- Official code for VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator☆62Updated last month
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated last year
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆24Updated 7 months ago
- [ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting☆31Updated last year
- VideoDirector [CVPR 2025]☆32Updated 7 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆52Updated last month
- ☆40Updated last year
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆52Updated 8 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆24Updated 5 months ago
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆55Updated 6 months ago
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆51Updated last year
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆59Updated last week
- Visual Spatial Tuning☆133Updated this week
- ☆26Updated 7 months ago