WU-CVGL / SIU3RLinks
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
☆30Updated this week
Alternatives and similar repositories for SIU3R
Users that are interested in SIU3R are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting☆47Updated this week
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆16Updated this week
- ☆38Updated 6 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆82Updated 2 months ago
- Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆144Updated 2 months ago
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆63Updated 3 months ago
- "VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames"☆86Updated 2 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆127Updated 3 months ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆32Updated last month
- ☆16Updated 8 months ago
- InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆164Updated last week
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆39Updated last month
- Official implementation of EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting☆46Updated 3 months ago
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆50Updated this week
- Official code for the paper: Can3Tok (ICCV2025)☆36Updated last month
- MEt3R: Measuring Multi-View Consistency in Generated Images☆131Updated 2 months ago
- ☆84Updated last week
- Seeing World Dynamics in a Nutshell☆109Updated 6 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆50Updated last month
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆194Updated 3 months ago
- ☆95Updated 3 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆123Updated 5 months ago
- The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".☆44Updated 4 months ago
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆324Updated this week
- ☆66Updated 9 months ago
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆93Updated last month
- Code for Faster VGGT with Block-Sparse Global Attention☆72Updated this week
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆264Updated last week
- Code for "Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views", CVPR 2025☆43Updated 2 months ago
- Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"☆94Updated 5 months ago