shiyoung77 / OVIR-3DLinks
This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)
☆106Updated last year
Alternatives and similar repositories for OVIR-3D
Users that are interested in OVIR-3D are comparing it to the libraries listed below
Sorting:
- Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields☆172Updated last year
- SceneFun3D ToolKit☆143Updated 2 months ago
- [CoRL2023] Open-Vocabulary Scene-Graph☆66Updated last year
- [CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"☆130Updated 2 weeks ago
- Code release for ConceptFusion [RSS 2023]☆211Updated last year
- Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.☆94Updated last year
- ☆62Updated 3 weeks ago
- [ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation☆123Updated 10 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆228Updated 3 months ago
- [ICLR 2025] 6D Object Pose Tracking in Internet Videos for Robotic Manipulation☆83Updated last week
- [arXiv 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆57Updated this week
- ☆216Updated last year
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆107Updated last month
- [ECCV 2024] Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking☆97Updated 9 months ago
- ☆110Updated 8 months ago
- ☆100Updated 3 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- ☆64Updated 5 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆108Updated 3 weeks ago
- ☆163Updated 4 months ago
- This repository contains code to reproduce experimental results from our HM3D paper in NeurIPS 2021.☆167Updated 3 years ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆245Updated 3 months ago
- [CVPR 2023] CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation☆136Updated last year
- ☆70Updated 3 weeks ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆131Updated 2 months ago
- Unifying 2D and 3D Vision-Language Understanding☆86Updated 2 months ago
- Code for "Robot See Robot Do" presented at CoRL 2024!☆133Updated 7 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆82Updated 2 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆157Updated last week
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆113Updated 8 months ago