iris0329/SeeGround

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iris0329/SeeGround)

iris0329 / SeeGround

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

☆222

Alternatives and similar repositories for SeeGround

Users that are interested in SeeGround are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JiawLin / SeqVLM
View on GitHub
[ACMMM 2025] Official implementation of SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero Shot 3D Visual Grounding
☆24Nov 25, 2025Updated 7 months ago
CurryYuan / ZSVG3D
View on GitHub
[CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
☆63Aug 3, 2024Updated last year
TeleeMa / SADE
View on GitHub
An Examination of the Compositionality of Large Generative Vision-Language Models
☆19Apr 9, 2024Updated 2 years ago
worldbench / 3EED
View on GitHub
[NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D
☆212Dec 26, 2025Updated 6 months ago
WHB139426 / TAB-Agent
View on GitHub
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding
☆24Apr 5, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TeleeMa / Sigma-Agent
View on GitHub
This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
☆32Oct 30, 2024Updated last year
liudaizong / Awesome-3D-Visual-Grounding
View on GitHub
😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.
☆282Jan 14, 2026Updated 6 months ago
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
InternRobotics / VLM-Grounder
View on GitHub
[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
☆134May 22, 2025Updated last year
ZhenyangLiu / ReasonGrounder
View on GitHub
☆15Jul 11, 2025Updated last year
xuxw98 / ESAM
View on GitHub
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
☆634May 7, 2025Updated last year
WHU-USI3DV / CityAnchor
View on GitHub
[ICLR'25] City-scale 3D Visual Grounding with Multi-modality LLMs
☆76Apr 10, 2026Updated 3 months ago
MTU3D / MTU3D
View on GitHub
☆266Aug 6, 2025Updated 11 months ago
tev-fbk / fun3du
View on GitHub
[CVPR25 Highlight] Official implementation of Fun3DU, a method for functional understanding and segmentation in 3D scenes
☆50Sep 30, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lslrh / DMA
View on GitHub
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
☆32Jul 18, 2024Updated 2 years ago
hovsg / HOV-SG
View on GitHub
[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"
☆513Jan 19, 2026Updated 6 months ago
MrZihan / g3D-LF
View on GitHub
Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).
☆56Jul 14, 2025Updated last year
yjtang249 / OnlineAnySeg
View on GitHub
[CVPR 2025] OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
☆45Jun 6, 2025Updated last year
GWxuan / TSP3D
View on GitHub
[CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
☆252Jun 11, 2025Updated last year
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆479Feb 5, 2026Updated 5 months ago
InternRobotics / EmbodiedScan
View on GitHub
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
☆672Jun 13, 2025Updated last year
pengsongyou / openscene
View on GitHub
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
☆840Oct 27, 2023Updated 2 years ago
XiaohanLei / GaussNav
View on GitHub
PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation
☆219Nov 11, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AIGeeksGroup / 3D-R1
View on GitHub
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
☆414Updated this week
VinAIResearch / Open3DIS
View on GitHub
Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
☆135Nov 12, 2024Updated last year
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆384Oct 21, 2025Updated 8 months ago
boschresearch / Open3DSG
View on GitHub
[CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
☆166Sep 16, 2024Updated last year
ZzZZCHS / Chat-Scene
View on GitHub
[NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
☆216Apr 12, 2026Updated 3 months ago
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆941Oct 27, 2025Updated 8 months ago
Pointcept / OpenIns3D
View on GitHub
[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
☆207Oct 19, 2024Updated last year
Open3DA / LL3DA
View on GitHub
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…
☆319Jul 17, 2024Updated 2 years ago
WangXihan-bit / GaussianGraph
View on GitHub
☆56Mar 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
unique1i / SceneSplat
View on GitHub
[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
☆354May 25, 2026Updated last month
BJHYZJ / DovSG
View on GitHub
[RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
☆163Apr 17, 2025Updated last year
Zeying-Gong / ascent
View on GitHub
[RAL‘26] Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration
☆139Jan 11, 2026Updated 6 months ago
jiaming-zhou / Zero-WAM
View on GitHub
Zero-WAM, an in-context world model for zero-shot robotic task generalization
☆31Jul 8, 2026Updated last week
facebookresearch / univlg
View on GitHub
Unifying 2D and 3D Vision-Language Understanding
☆126Jul 2, 2026Updated 2 weeks ago
ActiveVisionLab / Awesome-LLM-3D
View on GitHub
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
☆2,238Apr 16, 2026Updated 3 months ago
HaoyiZhu / SPA
View on GitHub
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆177Jun 19, 2025Updated last year