cfeng16 / GPS2Pix
[CVPR 2025] GPS as a Control Signal for Image Generation
☆18Updated last month
Alternatives and similar repositories for GPS2Pix:
Users that are interested in GPS2Pix are comparing it to the libraries listed below
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆31Updated this week
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆18Updated 3 weeks ago
- A list of works on video generation towards world model☆58Updated this week
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆39Updated last month
- ☆35Updated last month
- [ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting☆30Updated 8 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆96Updated 3 weeks ago
- Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆30Updated last month
- Open-world 3D part segmentation of point clouds☆75Updated last month
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated 8 months ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆20Updated 5 months ago
- ☆22Updated last month
- ☆25Updated last year
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆67Updated 2 months ago
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆24Updated 7 months ago
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆44Updated 9 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆50Updated last month
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆93Updated 11 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆51Updated 2 weeks ago
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆179Updated last year
- ☆21Updated 6 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated 2 months ago
- VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆21Updated last month
- [ECCV 2024] HiFi-123: Towards High-fidelity One Image to 3D Content Generation☆66Updated 9 months ago
- [CVPR2025] Official PyTorch implementation of "Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)"☆20Updated 2 months ago
- This is the project page of ShowRoom3D☆25Updated last year
- [ICLR 2025] Layout-Your-3D: Controllable and Precise 3D Generation with 2D Blueprint☆11Updated 2 months ago
- Spatial-R1: The first MLLM trained using GRPO for spatial reasoning in videos☆31Updated last week
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆38Updated 10 months ago