SOTAMak1r / GSTLinks
[ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction
☆39Updated last month
Alternatives and similar repositories for GST
Users that are interested in GST are comparing it to the libraries listed below
Sorting:
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆126Updated 3 months ago
- ☆95Updated 2 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆36Updated 3 weeks ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 5 months ago
- [ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis☆84Updated 3 weeks ago
- Seeing World Dynamics in a Nutshell☆109Updated 5 months ago
- ☆33Updated 4 months ago
- Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆142Updated 2 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆50Updated 3 weeks ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆100Updated 5 months ago
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆93Updated last month
- ☆101Updated last year
- An open source Multi-View Latent Diffusion Model☆38Updated 4 months ago
- official repository for SpatialVID☆182Updated this week
- ☆31Updated last year
- Self-reimplemented version of 4D-LRM.☆52Updated 3 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆131Updated last month
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆63Updated 11 months ago
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Updated 8 months ago
- The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".☆44Updated 3 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated last year
- open-sourced video dataset with dynamic scenes and camera movements annotation☆73Updated 4 months ago
- ☆81Updated this week
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆21Updated 3 months ago
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆44Updated last week
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆124Updated 2 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆45Updated 6 months ago
- ☆75Updated 3 months ago
- Code for Faster VGGT with Block-Sparse Global Attention☆57Updated this week
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆62Updated 3 months ago