xuxw98/ESAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xuxw98/ESAM)

xuxw98 / ESAM

[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time

☆634

Alternatives and similar repositories for ESAM

Users that are interested in ESAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xuxw98 / Online3D
View on GitHub
[CVPR 2024] Memory-based Adapters for Online 3D Scene Perception
☆124Mar 25, 2025Updated last year
xuxw98 / DSPDet3D
View on GitHub
[ECCV 2024] 3D Small Object Detection with Dynamic Spatial Pruning
☆116Aug 19, 2024Updated last year
bagh2178 / SG-Nav
View on GitHub
[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
☆346Sep 16, 2025Updated 10 months ago
GAP-LAB-CUHK-SZ / SAMPro3D
View on GitHub
SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)
☆171Apr 17, 2025Updated last year
wyf-ACCEPT / BackToReality
View on GitHub
[CVPR 2022] Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement
☆44Mar 5, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yjtang249 / OnlineAnySeg
View on GitHub
[CVPR 2025] OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
☆46Jun 6, 2025Updated last year
InternRobotics / EmbodiedScan
View on GitHub
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
☆672Jun 13, 2025Updated last year
GWxuan / TSP3D
View on GitHub
[CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
☆252Jun 11, 2025Updated last year
ZiyuGuo99 / SAM2Point
View on GitHub
The Most Faithful Implementation of Segment Anything (SAM) in 3D
☆359Sep 11, 2024Updated last year
Pointcept / OpenIns3D
View on GitHub
[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
☆207Oct 19, 2024Updated last year
dk-liang / UniSeg3D
View on GitHub
[NeurIPS 2024] A Unified Framework for 3D Scene Understanding
☆179Jul 7, 2025Updated last year
concept-graphs / concept-graphs
View on GitHub
Official code release for ConceptGraphs
☆908Oct 16, 2025Updated 9 months ago
MTU3D / MTU3D
View on GitHub
☆266Aug 6, 2025Updated 11 months ago
ant-research / DepthLab
View on GitHub
Official implementation of "DepthLab: From Partial to Complete"
☆551Feb 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Pointcept / SegmentAnything3D
View on GitHub
[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes
☆1,380Apr 21, 2024Updated 2 years ago
bagh2178 / UniGoal
View on GitHub
[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
☆347Sep 16, 2025Updated 10 months ago
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆943Oct 27, 2025Updated 8 months ago
HKUST-Aerial-Robotics / FM-Fusion
View on GitHub
[RA-L] FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models
☆153Sep 28, 2025Updated 9 months ago
concept-fusion / concept-fusion
View on GitHub
Code release for ConceptFusion [RSS 2023]
☆241Sep 23, 2023Updated 2 years ago
MIT-SPARK / Clio
View on GitHub
☆248Sep 1, 2025Updated 10 months ago
Mark12Ding / SAM2Long
View on GitHub
[ICCV 2025] SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
☆568Jul 29, 2025Updated 11 months ago
facebookresearch / univlg
View on GitHub
Unifying 2D and 3D Vision-Language Understanding
☆126Jul 2, 2026Updated 3 weeks ago
GuanxingLu / ManiGaussian
View on GitHub
[ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
☆282Mar 29, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
minghanqin / LangSplat
View on GitHub
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
☆1,068Oct 10, 2025Updated 9 months ago
UMass-Embodied-AGI / TesserAct
View on GitHub
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆402Aug 4, 2025Updated 11 months ago
HengyiWang / spann3r
View on GitHub
[3DV'25 Award Candidate] 3D Reconstruction with Spatial Memory
☆1,139Feb 25, 2025Updated last year
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,468Aug 27, 2025Updated 10 months ago
OpenMask3D / openmask3d
View on GitHub
☆265Dec 15, 2023Updated 2 years ago
hovsg / HOV-SG
View on GitHub
[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"
☆513Jan 19, 2026Updated 6 months ago
yd-yin / SAI3D
View on GitHub
[CVPR 2024] SAI3D: Segment Any Instance in 3D Scenes
☆162Mar 29, 2024Updated 2 years ago
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆387Oct 21, 2025Updated 9 months ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,084Jul 3, 2026Updated 3 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Eku127 / DualMap
View on GitHub
[RAL-25] An online open-vocabulary mapping system that enables natural language querying to navigate dynamic scenes, with ROS support.
☆203Jan 1, 2026Updated 6 months ago
facebookresearch / fast3r
View on GitHub
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
☆1,597May 7, 2025Updated last year
PKU-EPIC / MaskClustering
View on GitHub
[CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
☆129Apr 25, 2024Updated 2 years ago
vision-x-nyu / thinking-in-space
View on GitHub
Official repo and evaluation implementation of VSI-Bench
☆734Aug 5, 2025Updated 11 months ago
iris0329 / SeeGround
View on GitHub
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
☆222Apr 21, 2025Updated last year
HKUST-Aerial-Robotics / SG-Reg
View on GitHub
[T-RO 2025] SG-Reg: Generalizable and Efficient Scene Graph Registration
☆138Jul 20, 2025Updated last year
scene-verse / SceneVerse
View on GitHub
Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
☆288Mar 19, 2025Updated last year