yukangcao / Awesome-4D-Spatial-IntelligenceLinks
A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
☆360Updated last week
Alternatives and similar repositories for Awesome-4D-Spatial-Intelligence
Users that are interested in Awesome-4D-Spatial-Intelligence are comparing it to the libraries listed below
Sorting:
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆387Updated this week
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆410Updated this week
- A simple state update rule to enhance length generalization for CUT3R☆504Updated last month
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆200Updated 5 months ago
- IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction☆208Updated 2 weeks ago
- [ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction☆156Updated 2 months ago
- [ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆266Updated 3 weeks ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆527Updated 3 weeks ago
- Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer☆590Updated last month
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆293Updated 2 months ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆381Updated 2 weeks ago
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆155Updated last month
- Code for Streaming 4D Visual Geometry Transformer☆707Updated 2 weeks ago
- Official Implementation of "Dens3R: A Foundation Model for 3D Geometry Prediction"☆352Updated last month
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆288Updated 11 months ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆221Updated 3 months ago
- [CVPR 2025] GenFusion: Closing the Loop between Reconstruction and Generation via Videos☆150Updated 6 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆221Updated 2 weeks ago
- A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation☆225Updated 3 weeks ago
- [CVPR 2025] Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene G…☆122Updated 10 months ago
- Official implementation of EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting☆49Updated 4 months ago
- [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views☆534Updated 3 months ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆160Updated last month
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆473Updated 7 months ago
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆155Updated 11 months ago
- Cameras as Relative Positional Encoding☆607Updated 3 weeks ago
- Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"☆100Updated 7 months ago
- Stereo4D dataset and processing code☆273Updated last week
- "VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames"☆87Updated 4 months ago
- [NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"☆317Updated last month