yukangcao / Awesome-4D-Spatial-IntelligenceLinks
A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
☆400Updated this week
Alternatives and similar repositories for Awesome-4D-Spatial-Intelligence
Users that are interested in Awesome-4D-Spatial-Intelligence are comparing it to the libraries listed below
Sorting:
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆169Updated 2 months ago
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆455Updated 2 weeks ago
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆406Updated last week
- A simple state update rule to enhance length generalization for CUT3R☆545Updated 2 months ago
- IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction☆306Updated 3 weeks ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆314Updated 3 months ago
- [ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction☆161Updated last week
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆214Updated 7 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆554Updated 2 months ago
- Code for Streaming 4D Visual Geometry Transformer☆761Updated 2 months ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆447Updated last month
- Official Implementation of "Dens3R: A Foundation Model for 3D Geometry Prediction"☆361Updated 3 months ago
- [NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"☆369Updated 3 months ago
- A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation☆255Updated last week
- Official implementation of EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting☆51Updated 6 months ago
- [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views☆651Updated last week
- [ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆282Updated 2 weeks ago
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆496Updated 8 months ago
- [CVPR 2025] GenFusion: Closing the Loop between Reconstruction and Generation via Videos☆151Updated 8 months ago
- "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.☆192Updated this week
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆171Updated 3 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆222Updated 2 months ago
- Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer☆641Updated last month
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆225Updated 5 months ago
- Stereo4D dataset and processing code☆283Updated last month
- Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"☆105Updated 9 months ago
- Any4D: Unified Feed-Forward Metric 4D Reconstruction☆193Updated 2 weeks ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆300Updated last year
- [CVPR 2025] Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene G…☆122Updated 11 months ago
- [NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS☆165Updated 2 months ago