SOTAMak1r / GSTLinks
[ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction
☆42Updated 4 months ago
Alternatives and similar repositories for GST
Users that are interested in GST are comparing it to the libraries listed below
Sorting:
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆134Updated 6 months ago
- ☆121Updated 6 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆24Updated 8 months ago
- Official repo for: Epipolar Geometry Improves Video Generation Models☆69Updated 2 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆40Updated 4 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆58Updated 4 months ago
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆75Updated 2 months ago
- Seeing World Dynamics in a Nutshell☆111Updated 9 months ago
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆96Updated 3 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆105Updated 9 months ago
- ☆104Updated last year
- An open source Multi-View Latent Diffusion Model☆40Updated 7 months ago
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆64Updated last year
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Updated 11 months ago
- [arXiv'25]🌈 Unseen 3D Geometry Reasoning from a Single Image.☆73Updated 5 months ago
- ☆32Updated 2 years ago
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆116Updated 4 months ago
- Official code for VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator☆81Updated 2 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆90Updated 3 weeks ago
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆22Updated 6 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆79Updated last year
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆78Updated 2 months ago
- 📷 Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆90Updated last week
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆68Updated 6 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆145Updated 5 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆50Updated 9 months ago
- Self-reimplemented version of 4D-LRM.☆63Updated 6 months ago
- ☆35Updated 7 months ago
- Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"☆103Updated 6 months ago
- ☆87Updated 6 months ago