cfeng16 / GPS2PixLinks
[CVPR 2025] GPS as a Control Signal for Image Generation
☆23Updated 6 months ago
Alternatives and similar repositories for GPS2Pix
Users that are interested in GPS2Pix are comparing it to the libraries listed below
Sorting:
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆49Updated 2 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆69Updated 3 months ago
- VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆42Updated this week
- Self-reimplemented version of 4D-LRM.☆58Updated 4 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆49Updated 6 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆44Updated 4 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆54Updated 3 months ago
- Program synthesis for 3D spatial reasoning☆49Updated 3 months ago
- ☆27Updated last year
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆23Updated 5 months ago
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆34Updated 9 months ago
- VideoDirector [CVPR 2025]☆28Updated 6 months ago
- Official implementation of "URECA : Unique Region Caption Anything"☆53Updated 2 months ago
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models'☆20Updated 2 months ago
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆44Updated 3 weeks ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆145Updated 2 months ago
- (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''☆39Updated 11 months ago
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆26Updated last year
- ☆33Updated 11 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆81Updated 7 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆22Updated 3 months ago
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆53Updated 5 months ago
- Official implementation of DepthLM☆73Updated last week
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆52Updated 6 months ago
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆20Updated 3 months ago
- Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆81Updated 3 months ago
- [ICCV 2025] Official implementation of "What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?"☆14Updated 2 months ago
- [ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting☆31Updated last year
- ☆34Updated 4 months ago