[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"
☆84Jan 20, 2024Updated 2 years ago
Alternatives and similar repositories for 3D-CLR-Official
Users that are interested in 3D-CLR-Official are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆152Aug 23, 2023Updated 2 years ago
- Code for 3D-LLM: Injecting the 3D World into Large Language Models☆1,189Jun 6, 2024Updated last year
- Python package for importing and loading external assets into AI2THOR☆32Jan 5, 2026Updated 2 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Oct 1, 2024Updated last year
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆158Oct 13, 2023Updated 2 years ago
- ☆24Oct 22, 2023Updated 2 years ago
- ☆16Apr 10, 2025Updated 11 months ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆278Mar 19, 2025Updated last year
- SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks☆36Apr 29, 2024Updated last year
- Code for Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception☆17Aug 25, 2023Updated 2 years ago
- Code release for ConceptFusion [RSS 2023]☆231Sep 23, 2023Updated 2 years ago
- Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"☆216Sep 7, 2023Updated 2 years ago
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆53Jun 13, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆27Jan 3, 2024Updated 2 years ago
- ☆96Sep 4, 2024Updated last year
- Language-based navigation project☆22Feb 9, 2024Updated 2 years ago
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆59Jun 16, 2025Updated 9 months ago
- [IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning☆82Dec 6, 2024Updated last year
- [ICCV 2023] ARNOLD: Language-Grounded Robot Manipulation with Continuous Object States in Realistic 3D Scenes☆182Mar 16, 2025Updated last year
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆21Nov 18, 2025Updated 4 months ago
- ☆12Feb 27, 2020Updated 6 years ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆133Oct 24, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆46Apr 2, 2025Updated 11 months ago
- [NeurIPS 2024] MSR3D: Advanced Situated Reasoning in 3D Scenes☆71Dec 2, 2025Updated 3 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆215Jul 17, 2025Updated 8 months ago
- [CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI☆658Jun 13, 2025Updated 9 months ago
- Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)☆206Oct 20, 2025Updated 5 months ago
- ☆10Oct 18, 2024Updated last year
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…☆314Jul 17, 2024Updated last year
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆101Feb 26, 2023Updated 3 years ago
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆20Feb 2, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields☆140Dec 28, 2023Updated 2 years ago
- ☆593Jan 21, 2026Updated 2 months ago
- Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"☆95Jun 9, 2023Updated 2 years ago
- [ICML 2020] Visual Grounding of Learned Physical Models☆40Dec 31, 2020Updated 5 years ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆18Feb 15, 2026Updated last month
- StructDiffusion: Language-Guided Creation of Physically-Valid Structures using Unseen Objects☆59Jul 10, 2023Updated 2 years ago
- Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI☆11Mar 3, 2024Updated 2 years ago