yukangcao / Awesome-4D-Spatial-IntelligenceLinks
A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
☆327Updated last week
Alternatives and similar repositories for Awesome-4D-Spatial-Intelligence
Users that are interested in Awesome-4D-Spatial-Intelligence are comparing it to the libraries listed below
Sorting:
- A simple state update rule to enhance length generalization for CUT3R☆462Updated 3 weeks ago
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆392Updated this week
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆196Updated 5 months ago
- Official Implementation of "Dens3R: A Foundation Model for 3D Geometry Prediction"☆346Updated 3 weeks ago
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆376Updated last week
- [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views☆484Updated 2 months ago
- [ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction☆153Updated 2 months ago
- [CVPR 2025] Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene G…☆120Updated 9 months ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆218Updated 3 months ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆285Updated 10 months ago
- Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer☆548Updated last week
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆148Updated 2 weeks ago
- ☆317Updated 10 months ago
- Official implementation of EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting☆48Updated 4 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆218Updated 6 months ago
- Self-reimplemented version of Long-LRM.☆194Updated last week
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆281Updated last month
- [ICCV 2025] Aether: Geometric-Aware Unified World Modeling☆508Updated 3 months ago
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆156Updated 11 months ago
- Code for Streaming 4D Visual Geometry Transformer☆671Updated 2 months ago
- Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting☆139Updated 3 weeks ago
- Stereo4D dataset and processing code☆269Updated last week
- Cameras as Relative Positional Encoding☆597Updated this week
- [ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆250Updated last week
- A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation☆205Updated last week
- [NeurIPS 2024] MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting☆137Updated 5 months ago
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆461Updated 6 months ago
- [NeurIPS'2024]: DiffGS: Functional Gaussian Splatting Diffusion☆251Updated 6 months ago
- [NeurIPS 2025] ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS☆134Updated 3 weeks ago
- [CVPR 2025] GenFusion: Closing the Loop between Reconstruction and Generation via Videos☆148Updated 6 months ago