[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
☆101Feb 2, 2025Updated last year
Alternatives and similar repositories for Lexicon3D
Users that are interested in Lexicon3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆44Dec 9, 2024Updated last year
- [ICLR 2025] Duoduo CLIP: Efficient 3D Understanding with Multi-View Images☆81May 29, 2025Updated 11 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆32Jul 18, 2024Updated last year
- ☆23Apr 4, 2026Updated last month
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆84Oct 10, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆57Mar 28, 2024Updated 2 years ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆348Dec 1, 2025Updated 5 months ago
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆96Mar 26, 2026Updated 2 months ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆283Mar 19, 2025Updated last year
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆53Jun 13, 2024Updated last year
- [ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding☆126Jul 2, 2024Updated last year
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆178Feb 27, 2026Updated 2 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆37Jan 20, 2025Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆85Aug 2, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Oct 2, 2023Updated 2 years ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Mar 31, 2025Updated last year
- Unifying 2D and 3D Vision-Language Understanding☆125Apr 8, 2026Updated last month
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆175Jul 7, 2025Updated 10 months ago
- Code for 3D-LLM: Injecting the 3D World into Large Language Models☆1,198Jun 6, 2024Updated last year
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation☆205Oct 19, 2024Updated last year
- [ECCV 2022, Oral] OPD: Single-view 3D Openable Part Detection☆35Apr 2, 2026Updated last month
- [NeurIPS 2024] MSR3D: Advanced Situated Reasoning in 3D Scenes☆72Dec 2, 2025Updated 5 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆125May 30, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)☆165Apr 17, 2025Updated last year
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆378Oct 21, 2025Updated 7 months ago
- [CVPR 2026 Highlight] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence☆72Apr 17, 2026Updated last month
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆324Dec 21, 2025Updated 5 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆41Jun 9, 2025Updated 11 months ago
- ☆23Apr 19, 2024Updated 2 years ago
- [ECCV2024] [3DV Nectar 2025] FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally☆256Sep 13, 2024Updated last year
- Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"☆215Sep 7, 2023Updated 2 years ago
- [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies☆820Oct 27, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆55Oct 3, 2024Updated last year
- Official implementation of PartSTAD: 2D-to-3D Part Segmentation Task Adaptation (ECCV 2024).☆56Nov 7, 2024Updated last year
- [ECCV 2024] The official PyTorch implementation of the "Part2Object: Hierarchical Unsupervised 3D Instance Segmentation".☆25Sep 12, 2024Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆15Jul 4, 2025Updated 10 months ago
- Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection☆13Mar 23, 2025Updated last year
- ☆37Jan 8, 2026Updated 4 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆175Jun 19, 2025Updated 11 months ago