SOTAMak1r / GST
[ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction
☆31Updated 2 months ago
Alternatives and similar repositories for GST:
Users that are interested in GST are comparing it to the libraries listed below
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆62Updated 2 weeks ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆22Updated last week
- "VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames"☆63Updated last month
- [CVPR 2025] Official code for the paper "SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis"☆70Updated last month
- open-sourced video dataset with dynamic scenes and camera movements annotation☆31Updated last week
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆40Updated 3 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆42Updated last month
- ☆16Updated 3 months ago
- ☆99Updated 8 months ago
- ☆33Updated 3 weeks ago
- (CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers☆38Updated 6 months ago
- ☆58Updated 2 months ago
- Code for "Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views", CVPR 2025☆25Updated 2 weeks ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆29Updated last month
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆83Updated last week
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆42Updated 4 months ago
- ☆24Updated last year
- A collection of object-compositional modeling by implicit neural representation.☆58Updated last year
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆89Updated 3 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆80Updated 3 weeks ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆79Updated 7 months ago
- ☆27Updated 9 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆97Updated 2 weeks ago
- ☆39Updated 6 months ago
- [CVPR2025] MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model☆67Updated 2 weeks ago
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆46Updated last month
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆61Updated 6 months ago
- StableRecon: Making Video to 3D easy☆76Updated 5 months ago
- Official Pytorch Implement for "Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects", Neurips 2023☆25Updated last month
- ConDense backbone, weights, and evaluation code.☆32Updated 9 months ago