wzzheng / StreamVGGTLinks
Code for Streaming 4D Visual Geometry Transformer
☆630Updated last month
Alternatives and similar repositories for StreamVGGT
Users that are interested in StreamVGGT are comparing it to the libraries listed below
Sorting:
- Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer☆494Updated this week
- Code of π^3: Permutation-Equivariant Visual Geometry Learning☆1,236Updated 2 weeks ago
- Official implement of VGGT-Long☆539Updated 3 weeks ago
- [ICCV 2025] Aether: Geometric-Aware Unified World Modeling☆482Updated 2 months ago
- [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views☆423Updated last month
- A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)☆304Updated last month
- Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page …☆531Updated last week
- ☆339Updated 3 weeks ago
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆447Updated 5 months ago
- Official implemetation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting".☆218Updated last year
- PE3R: Perception-Efficient 3D Reconstruction. Take 2 - 3 photos with your phone, upload them, wait a few minutes, and then start explorin…☆381Updated 5 months ago
- [ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction☆153Updated last month
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆213Updated 5 months ago
- Cameras as Relative Positional Encoding☆577Updated last week
- [NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats☆491Updated 2 months ago
- [ECCV2024] [3DV Nectar 2025] FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally☆235Updated last year
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆355Updated this week
- [ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆203Updated 2 weeks ago
- [CVPR 2025 Highlight] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos☆436Updated 5 months ago
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆348Updated this week
- [ICLR' 25] SplatFormer: Point Transformer for Robust 3D Gaussian Splatting☆363Updated 6 months ago
- Official Implementation of "Dens3R: A Foundation Model for 3D Geometry Prediction"☆332Updated 3 weeks ago
- TAPIP3D: Tracking Any Point in Persistent 3D Geometry☆285Updated 4 months ago
- [ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes☆244Updated 4 months ago
- [CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields☆574Updated 11 months ago
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆138Updated 5 months ago
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆152Updated 10 months ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆278Updated 9 months ago
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆768Updated last week
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆418Updated last month