sosppxo/mvggt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sosppxo/mvggt)

sosppxo / mvggt

[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation

☆88

Alternatives and similar repositories for mvggt

Users that are interested in mvggt are comparing it to the libraries listed below

Sorting:

kaist-ami / Uni-DVPS
View on GitHub
[RA-L'24, IROS'24] Official PyTorch Implementation of "Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation"
☆13Oct 11, 2024Updated last year
aliy98 / slope_constrained_planner
View on GitHub
A global path planner for quadruped robots which considers slope of the terrain as constraint
☆19Feb 13, 2026Updated 3 weeks ago
zju3dv / blink_sim
View on GitHub
Code for "BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events", ECCV 2024 and…
☆20Feb 13, 2025Updated last year
SelfAI-research / AnywhereVLA
View on GitHub
Repository for the "AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation" paper
☆20Oct 25, 2025Updated 4 months ago
wuzirui / pvsm
View on GitHub
Official code release for the PVSM paper: "From Rays to Projections: Better Inputs for Feed-Forward View Synthesis"
☆40Jan 9, 2026Updated 2 months ago
Kidrauh / flow3r
View on GitHub
☆54Feb 27, 2026Updated last week
lifuguan / LangSurf
View on GitHub
[Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
☆43Aug 18, 2025Updated 6 months ago
esw0116 / OmniSplat
View on GitHub
[CVPR 2025] OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities
☆34Jun 6, 2025Updated 9 months ago
HITSZ-NRSL / Terrain-aware-planning
View on GitHub
☆39May 27, 2025Updated 9 months ago
Jho-Yonsei / SwiftVGGT
View on GitHub
[CVPR 2026 Findings] SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes
☆57Nov 25, 2025Updated 3 months ago
wrchen530 / batrack
View on GitHub
[ICCV 2025 Oral] Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction (BA-Track)
☆95Nov 25, 2025Updated 3 months ago
juhyeon-kwon / Instruct-4DGS
View on GitHub
CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation
☆25Sep 21, 2025Updated 5 months ago
will-zzy / siggraph_asia
View on GitHub
Open-source code for the first-place solution of the [SIGGRAPH Asia 2025 3DGS Challenge](https://gaplab.cuhk.edu.cn/projects/gsRaceSIGA20…
☆45Jan 28, 2026Updated last month
Ericcsr / synthesize_pregrasp
View on GitHub
contact planning for dexterous hand manipulation
☆19Jul 8, 2023Updated 2 years ago
j3soon / ros2-essentials
View on GitHub
A repo containing essential ROS2 Humble features for controlling Autonomous Mobile Robots (AMRs) and robotic arm manipulators.
☆39Feb 26, 2026Updated last week
xingyoujun / transplat
View on GitHub
(AAAI 2025) TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
☆69Dec 18, 2024Updated last year
chenshi3 / UniSplat
View on GitHub
[ICLR 2026] Official Implementation of "UniSplat: Unified Spatio-Temporal Fusion via 3D Latent Scaffolds for Dynamic Driving Scene Recons…
☆59Updated this week
Luo-Yihang / 4RC
View on GitHub
4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere
☆89Feb 11, 2026Updated 3 weeks ago
SobhanGhayedzadeh / elevation-mapping-realsense-d435i
View on GitHub
Elevation Mapping with RealSense Camera d435i
☆40Apr 16, 2025Updated 10 months ago
shengliangd / StereoVLA
View on GitHub
StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.
☆53Jan 12, 2026Updated last month
ou524u / Less3Depend
View on GitHub
[ICLR 2026] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with…
☆52Jan 26, 2026Updated last month
aquastripe / DenseCLIP
View on GitHub
An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"
☆23Jan 27, 2022Updated 4 years ago
jzhzhang / Uni-NaVid
View on GitHub
[RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.
☆247Dec 15, 2025Updated 2 months ago
AIBluefisher / DeepGfM
View on GitHub
DeepImplementation of the NeurIPS 2025 paper: Gaussian from Motion: Exploring 3D Geometric Foundation Models for Gaussian Splatting
☆137Dec 3, 2025Updated 3 months ago
jacky121298 / WLST
View on GitHub
[ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection
☆12Feb 6, 2024Updated 2 years ago
SJTU-DeepVisionLab / LaGa
View on GitHub
Tackling View-Dependent Semantics in 3D Language Gaussian Splatting (ICML 2025)
☆62Jun 3, 2025Updated 9 months ago
Zhao-Jianing-SUDA / Hawkeye
View on GitHub
The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…
☆12Oct 14, 2024Updated last year
UoY-RoboStar / AURO
View on GitHub
ROS2 code for AURO practicals
☆11Jul 24, 2025Updated 7 months ago
ImperialCollegeLondon / Holo-SpoK
View on GitHub
This project includes a Unity app and a set of ROS packages for controlling Boston Dynamics' Spot using the Microsoft's HoloLens 2
☆13Oct 25, 2024Updated last year
kuai-lab / cvpr25_EditSplat
View on GitHub
EditSplat: Multi-View Fusion & Attention-Guided Optimization for View-Consistent 3D Scene Editing (CVPR 2025) - Official Pytorch Code
☆50Dec 16, 2025Updated 2 months ago
leggedrobotics / foci
View on GitHub
Fast, orientation-aware trajectory planning using a novel Gaussian overlap-based collision formulation, modeling both robot and environme…
☆47Jul 16, 2025Updated 7 months ago
TencentARC / Track4World
View on GitHub
Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels
☆69Updated this week
ahrs365 / bspline-lattice-planner
View on GitHub
bspline and lattice for trajectory plan
☆45Jan 23, 2026Updated last month
hustvl / 4DLangVGGT
View on GitHub
Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”
☆82Dec 10, 2025Updated 2 months ago
facebookresearch / visual_inertial_bundle_adjustment
View on GitHub
Visual-inertial optimization of VIO trajectories and SLAM maps via accurate sensor modelling, with imu pre-integrated terms, full re-esti…
☆88Feb 19, 2026Updated 2 weeks ago
MrZihan / g3D-LF
View on GitHub
Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).
☆45Jul 14, 2025Updated 7 months ago
kedarrajpathak / bimanual_teleoperation
View on GitHub
ROS2 packages for dual arm setup of Kinova robot and control using MoveIt Servo and ArUco pose estimation
☆10Jul 27, 2025Updated 7 months ago
gabfstr / DiffusionTrack
View on GitHub
Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking
☆13Apr 12, 2023Updated 2 years ago
LucasG2001 / cartesian_impedance_control
View on GitHub
ROS2 catestian_impedance_controller from PdZ
☆11Oct 22, 2025Updated 4 months ago