SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
☆502Feb 21, 2026Updated last week
Alternatives and similar repositories for SpatialVID
Users that are interested in SpatialVID are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning☆1,642Jan 28, 2026Updated last month
- ☆703May 1, 2025Updated 10 months ago
- Code for Faster VGGT with Block-Sparse Global Attention☆89Nov 14, 2025Updated 3 months ago
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆1,271Sep 24, 2025Updated 5 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆433Updated this week
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"☆411Nov 24, 2025Updated 3 months ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,345Aug 27, 2025Updated 6 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆573Oct 26, 2025Updated 4 months ago
- Cameras as Relative Positional Encoding☆676Dec 18, 2025Updated 2 months ago
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆913Sep 26, 2025Updated 5 months ago
- Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"☆1,232Jan 5, 2026Updated last month
- A simple state update rule to enhance length generalization for CUT3R☆586Oct 1, 2025Updated 5 months ago
- Official implement of VGGT-Long☆790Feb 9, 2026Updated 3 weeks ago
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆832Oct 27, 2025Updated 4 months ago
- [CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".☆405Updated this week
- Stereo4D dataset and processing code☆292Nov 4, 2025Updated 3 months ago
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆503Aug 4, 2025Updated 6 months ago
- Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]☆316Feb 2, 2026Updated last month
- [NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"☆401Sep 19, 2025Updated 5 months ago
- Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page …☆577Feb 23, 2026Updated last week
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆179Sep 26, 2025Updated 5 months ago
- [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆432Oct 2, 2025Updated 5 months ago
- [CVPR 2025 Highlight] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos☆455Apr 4, 2025Updated 10 months ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision☆2,312Nov 2, 2025Updated 4 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,748Nov 28, 2025Updated 3 months ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,561Jun 5, 2025Updated 8 months ago
- [SIGGRAPH Asia'24 & TOG] Gaussian Opacity Fields: Efficient Adaptive Surface Reconstruction in Unbounded Scenes☆983Nov 15, 2024Updated last year
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆199May 20, 2025Updated 9 months ago
- ViPE: Video Pose Engine for Geometric 3D Perception☆1,731Jan 1, 2026Updated 2 months ago
- News: the 10k dataset is ready for download.☆573Feb 10, 2026Updated 2 weeks ago
- [CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass☆1,510May 7, 2025Updated 9 months ago
- MapAnything: Universal Feed-Forward Metric 3D Reconstruction☆2,915Jan 18, 2026Updated last month
- [ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields☆509Oct 31, 2025Updated 4 months ago
- [CVPR 2025] GenFusion: Closing the Loop between Reconstruction and Generation via Videos☆162Apr 22, 2025Updated 10 months ago
- [ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆412Jun 6, 2025Updated 8 months ago
- [ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images☆930Updated this week
- LATTICE: Democratize High-Fidelity 3D Generation at Scale (CVPR'26)☆224Updated this week
- [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views☆736Dec 22, 2025Updated 2 months ago
- [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal☆753Aug 2, 2025Updated 7 months ago