BIT-DYN / OpenObj
[RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding
☆27Updated 2 months ago
Alternatives and similar repositories for OpenObj:
Users that are interested in OpenObj are comparing it to the libraries listed below
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆62Updated 2 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆53Updated 9 months ago
- [ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention☆19Updated 2 months ago
- ☆49Updated 7 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆115Updated 2 weeks ago
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆18Updated 3 weeks ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆83Updated 3 weeks ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆51Updated 2 weeks ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆32Updated 7 months ago
- [CoRL2023] Open-Vocabulary Scene-Graph☆66Updated last year
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆31Updated 9 months ago
- ☆59Updated 4 months ago
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆111Updated 4 months ago
- PyTorch implementation of CVPR 2024 paper: Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation☆26Updated 6 months ago
- ☆57Updated last month
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆99Updated this week
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation☆98Updated last year
- Open-source implementations on real robots☆32Updated 5 months ago
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆39Updated 2 weeks ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 5 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆46Updated 4 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆59Updated last month
- ☆14Updated last week
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆77Updated 6 months ago
- ☆40Updated last week
- Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆18Updated 4 months ago
- [CVPR 2025] PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding☆25Updated last month
- Unifying 2D and 3D Vision-Language Understanding☆79Updated 3 weeks ago
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆116Updated 5 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆71Updated 3 weeks ago