sosppxo / mvggtLinks
This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
☆48Updated last week
Alternatives and similar repositories for mvggt
Users that are interested in mvggt are comparing it to the libraries listed below
Sorting:
- Project page for Neural Shell Texture Splatting (ICCV 2025)☆31Updated 3 months ago
- Code implementation of Pi-Long☆160Updated last month
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆61Updated 5 months ago
- 🔥TRACE in PyTorch (ICCV 2025)☆22Updated 2 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆52Updated last month
- OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer☆261Updated last week
- ☆43Updated 5 months ago
- Official implementation of IROS 2025 paper Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline☆47Updated 5 months ago
- ☆79Updated 5 months ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆159Updated 2 weeks ago
- The official implementation of InfiniteVGGT☆245Updated this week
- 4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.☆47Updated last week
- [IROS 2024] Incrementally Building Room-Scale Language-Embedded Gaussian Splats (LEGS) with a Mobile Robot☆59Updated 8 months ago
- A curated list of awesome exploration policy papers.☆13Updated 2 weeks ago
- Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images☆115Updated 4 months ago
- [WACV2025] Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field☆15Updated last year
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆123Updated 3 months ago
- Release repository of our work "Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers"☆148Updated 2 months ago
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆79Updated 6 months ago
- [ICCV2025] Adversarial Exploitation of Data Diversity Improves Visual Localization☆47Updated last month
- ☆27Updated 3 months ago
- Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)☆70Updated last month
- Official implementation of Video-DPM☆56Updated last week
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆72Updated 7 months ago
- [RSS 2025] GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction☆82Updated 3 months ago
- This is a list of relevant papers for 3D Geometric Foundation Models and Applications.☆112Updated last week
- Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"☆105Updated 9 months ago
- ONNX models of VGGT☆56Updated 6 months ago
- CoSurfGS: Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction☆64Updated last year
- [ICCV2025] Extrapolated Urban View Synthesis Benchmark☆47Updated 3 months ago