[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
☆111Mar 24, 2026Updated 3 weeks ago
Alternatives and similar repositories for mvggt
Users that are interested in mvggt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities☆36Jun 6, 2025Updated 10 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆81Mar 15, 2026Updated last month
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆38Feb 4, 2026Updated 2 months ago
- FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 2025.☆38Dec 18, 2024Updated last year
- ☆70Feb 27, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [BMVC 2025] Occam’s LGS: An Efficient Approach for Language Gaussian Splatting☆62Nov 18, 2025Updated 5 months ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- [RA-L'24, IROS'24] Official PyTorch Implementation of "Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation"☆13Oct 11, 2024Updated last year
- ☆16Oct 5, 2023Updated 2 years ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆44Nov 21, 2025Updated 4 months ago
- Next-Generation AI-Assisted Kernel Engineering for Multi-Chip Systems☆39Updated this week
- Code for "BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events", ECCV 2024 and…☆20Feb 13, 2025Updated last year
- Tools to support the calibration of the inner virtual camera of Gazebo.☆11May 28, 2020Updated 5 years ago
- [ICCV 2025 Oral] Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction (BA-Track)☆99Nov 25, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Segment Anything with Deictic Prompting☆27May 13, 2025Updated 11 months ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆44Aug 18, 2025Updated 8 months ago
- SplatSDF, ICRA 2026☆20Feb 21, 2026Updated last month
- ☆14Dec 16, 2021Updated 4 years ago
- 🔥 [CVPR24] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction☆116Apr 8, 2024Updated 2 years ago
- A global path planner for quadruped robots which considers slope of the terrain as constraint☆19Feb 13, 2026Updated 2 months ago
- Official Implementation of "Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity (ICML 2024)"☆43Aug 28, 2024Updated last year
- Repository for the "AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation" paper☆23Oct 25, 2025Updated 5 months ago
- [CVPR 2026 Findings] SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes☆63Nov 25, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dial…☆28Apr 1, 2026Updated 2 weeks ago
- Trajectory planning for a mobile robot on uneven terrain☆42Apr 11, 2026Updated last week
- [AAAI 2025] Offical implementation of "DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible …☆217Dec 21, 2024Updated last year
- [ICLR 2026] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with…☆54Jan 26, 2026Updated 2 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆229Feb 11, 2026Updated 2 months ago
- ☆12Dec 4, 2024Updated last year
- [RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.☆275Dec 15, 2025Updated 4 months ago
- ☆12Apr 18, 2025Updated last year
- 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere☆114Updated this week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆52Dec 4, 2025Updated 4 months ago
- CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation☆28Sep 21, 2025Updated 6 months ago
- [3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting☆159Dec 9, 2025Updated 4 months ago
- An attempt on implementing 3D gaussian splatting☆42Jul 27, 2023Updated 2 years ago
- ☆12Mar 1, 2023Updated 3 years ago
- [CVPR'26] Official implementation of "Emergent Outlier View Rejection in Visual Geometry Grounded Transformers"☆178Feb 22, 2026Updated last month
- Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)☆81Dec 22, 2025Updated 3 months ago