CognitiveAISystems / 3DGraphLLM
3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.
☆37Updated last month
Alternatives and similar repositories for 3DGraphLLM:
Users that are interested in 3DGraphLLM are comparing it to the libraries listed below
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆82Updated 2 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆98Updated 9 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆71Updated 2 weeks ago
- ☆37Updated last year
- ☆36Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆41Updated 3 weeks ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆58Updated 3 weeks ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)☆87Updated 3 months ago
- Code of 3DMIT: 3D MULTI-MODAL INSTRUCTION TUNING FOR SCENE UNDERSTANDING☆28Updated 6 months ago
- SceneFun3D ToolKit☆89Updated 4 months ago
- Improving 3D Large Language Model via Robust Instruction Tuning☆51Updated 4 months ago
- [CVPR 2024, Highlight] Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments☆87Updated 7 months ago
- ☆34Updated 10 months ago
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆40Updated last year
- ☆48Updated 4 months ago
- Official Implementation of 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆36Updated 8 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆24Updated this week
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆102Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆35Updated 2 months ago
- [ICLR 2024] AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation☆105Updated 4 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆36Updated 2 months ago
- ☆59Updated last week
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆66Updated last year
- ☆18Updated 3 weeks ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆47Updated 6 months ago
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆35Updated last month
- [CoRL2023] Open-Vocabulary Scene-Graph☆63Updated last year
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)☆101Updated last year