[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
☆100Mar 24, 2026Updated this week
Alternatives and similar repositories for mvggt
Users that are interested in mvggt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities☆35Jun 6, 2025Updated 9 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆55Mar 15, 2026Updated 2 weeks ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆34Feb 4, 2026Updated last month
- FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 2025.☆37Dec 18, 2024Updated last year
- ☆64Feb 27, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [BMVC 2025] Occam’s LGS: An Efficient Approach for Language Gaussian Splatting☆61Nov 18, 2025Updated 4 months ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- [RA-L'24, IROS'24] Official PyTorch Implementation of "Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation"☆13Oct 11, 2024Updated last year
- ☆16Oct 5, 2023Updated 2 years ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆41Nov 21, 2025Updated 4 months ago
- Code for "BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events", ECCV 2024 and…☆20Feb 13, 2025Updated last year
- Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It c…☆176Jul 26, 2023Updated 2 years ago
- Official code release for the PVSM paper: "From Rays to Projections: Better Inputs for Feed-Forward View Synthesis"☆43Jan 9, 2026Updated 2 months ago
- [ICLR 2026] Official Implementation of "UniSplat: Unified Spatio-Temporal Fusion via 3D Latent Scaffolds for Dynamic Driving Scene Recons…☆65Mar 4, 2026Updated 3 weeks ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Open-source code for the first-place solution of the [SIGGRAPH Asia 2025 3DGS Challenge](https://gaplab.cuhk.edu.cn/projects/gsRaceSIGA20…☆53Jan 28, 2026Updated 2 months ago
- Tools to support the calibration of the inner virtual camera of Gazebo.☆10May 28, 2020Updated 5 years ago
- [ICCV 2025 Oral] Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction (BA-Track)☆97Nov 25, 2025Updated 4 months ago
- Segment Anything with Deictic Prompting☆27May 13, 2025Updated 10 months ago
- A global path planner for quadruped robots which considers slope of the terrain as constraint☆18Feb 13, 2026Updated last month
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆44Aug 18, 2025Updated 7 months ago
- ☆12Oct 5, 2020Updated 5 years ago
- ☆14Dec 16, 2021Updated 4 years ago
- Official Implementation of "Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity (ICML 2024)"☆43Aug 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Visual-inertial optimization of VIO trajectories and SLAM maps via accurate sensor modelling, with imu pre-integrated terms, full re-esti…☆92Mar 20, 2026Updated last week
- Repository for the "AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation" paper☆23Oct 25, 2025Updated 5 months ago
- [CVPR 2026 Findings] SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes☆63Nov 25, 2025Updated 4 months ago
- [ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dial…☆25Dec 6, 2025Updated 3 months ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆137Feb 24, 2026Updated last month
- SimCSE的tensorflow版本实现,以及基础实验对比☆13Jul 22, 2021Updated 4 years ago
- Code for our ACL 2023 paper: Causality-aware Concept Extraction based on Knowledge-guided Prompting☆14Aug 19, 2023Updated 2 years ago
- A temporary repo to share the DMBERT code for Event Detection☆13Apr 19, 2020Updated 5 years ago
- [AAAI 2025] Offical implementation of "DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible …☆209Dec 21, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆228Feb 11, 2026Updated last month
- [ICLR 2026] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with…☆53Jan 26, 2026Updated 2 months ago
- [RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.☆257Dec 15, 2025Updated 3 months ago
- ☆12Dec 4, 2024Updated last year
- SimCSE☆15Oct 1, 2022Updated 3 years ago
- Edge deployment guide for InternNav-based perception and navigation on Unitree Go2 / Go2W / B2 robots (ROS 2, RealSense, Python).☆54Dec 9, 2025Updated 3 months ago
- [CVPR '26] SceneTok: A Compressed, Diffusable Token Space for 3D Scenes☆136Updated this week