WU-CVGL / SIU3RLinks
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
☆124Updated last month
Alternatives and similar repositories for SIU3R
Users that are interested in SIU3R are comparing it to the libraries listed below
Sorting:
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆54Updated 2 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆133Updated 4 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆41Updated 2 months ago
- ☆111Updated 4 months ago
- [ICCV 2025] ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting☆59Updated 3 weeks ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆153Updated last month
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆17Updated last month
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆35Updated 2 months ago
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆22Updated last week
- "VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames"☆86Updated 3 months ago
- ☆38Updated 7 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆38Updated 2 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 6 months ago
- ☆16Updated 10 months ago
- Seeing World Dynamics in a Nutshell☆110Updated 7 months ago
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆65Updated 4 months ago
- (CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers☆41Updated last year
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆24Updated 4 months ago
- ☆84Updated this week
- Code for "Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views", CVPR 2025☆43Updated 3 months ago
- Code for Faster VGGT with Block-Sparse Global Attention☆82Updated last month
- The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".☆48Updated 5 months ago
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆76Updated last month
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆106Updated 2 months ago
- ☆91Updated last month
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆57Updated last week
- MEt3R: Measuring Multi-View Consistency in Generated Images☆138Updated 3 months ago
- Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"☆102Updated 4 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆86Updated 3 weeks ago
- [CVPR 2025] Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields☆26Updated 2 weeks ago