cfeng16 / GPS2PixLinks
[CVPR 2025] GPS as a Control Signal for Image Generation
☆24Updated 8 months ago
Alternatives and similar repositories for GPS2Pix
Users that are interested in GPS2Pix are comparing it to the libraries listed below
Sorting:
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆77Updated 5 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆56Updated 2 weeks ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆48Updated 8 months ago
- Self-reimplemented version of 4D-LRM.☆63Updated 6 months ago
- The official implementation of paper “VChain: Chain-of-Visual-Thought for Reasoning in Video Generation”☆106Updated 2 months ago
- ☆30Updated last year
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆45Updated 6 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆27Updated 5 months ago
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆21Updated 5 months ago
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆27Updated last year
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆24Updated 7 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆59Updated 5 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated last year
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆36Updated 11 months ago
- (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''☆39Updated last year
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆53Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆85Updated 9 months ago
- ☆18Updated last month
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆52Updated last year
- VideoDirector [CVPR 2025]☆33Updated 2 weeks ago
- Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆88Updated this week
- [ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting☆31Updated last year
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Updated 3 months ago
- Official implementation of "URECA : Unique Region Caption Anything"☆55Updated 4 months ago
- Official code for VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator☆74Updated last month
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Updated 8 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆170Updated this week
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆49Updated 2 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated last year
- [CVPR 2024 Highlight] ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models☆173Updated last year