shiyoung77 / OVIR-3D
This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)
☆104Updated last year
Alternatives and similar repositories for OVIR-3D:
Users that are interested in OVIR-3D are comparing it to the libraries listed below
- ☆206Updated last year
- SceneFun3D ToolKit☆123Updated this week
- Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields☆163Updated last year
- Code release for ConceptFusion [RSS 2023]☆203Updated last year
- [CoRL2023] Open-Vocabulary Scene-Graph☆64Updated last year
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆215Updated 4 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆85Updated last month
- Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.☆79Updated 9 months ago
- [ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation☆114Updated 6 months ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆89Updated 3 months ago
- Code for "Robot See Robot Do" presented at CoRL 2024!☆100Updated 3 months ago
- [ICLR 2024] AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation☆107Updated last week
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆64Updated 5 months ago
- ☆84Updated last week
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆231Updated 3 weeks ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆132Updated last week
- [ICCV 2023] SGAligner: 3D Scene Alignment with Scene Graphs☆91Updated 3 months ago
- [ECCV 2024] Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking☆77Updated 6 months ago
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation☆178Updated 4 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆73Updated 7 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆147Updated this week
- [ICRA, 2025] SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting☆85Updated last month
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆30Updated 5 months ago
- ☆81Updated 5 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆50Updated this week
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆24Updated 3 weeks ago
- [RA-L 2024] GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping☆124Updated 8 months ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆141Updated last week
- [CVPR 2024, Highlight] Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments☆89Updated 8 months ago