yunzeliu / MAP
☆16Updated last week
Alternatives and similar repositories for MAP:
Users that are interested in MAP are comparing it to the libraries listed below
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"☆50Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 3 months ago
- [NeurIPS 2024 Spotlight] PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders☆34Updated 3 weeks ago
- Open-world 3D part segmentation of point clouds☆71Updated last week
- [NeurIPS 2024] Official code repository for MSR3D paper☆44Updated 3 weeks ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆71Updated 3 weeks ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆65Updated 2 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆60Updated 5 months ago
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆12Updated this week
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆67Updated 4 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆56Updated 3 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆28Updated last month
- A point cloud visualization repo☆74Updated 2 weeks ago
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆89Updated 9 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated 8 months ago
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆82Updated last year
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆18Updated last week
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆51Updated 7 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated 10 months ago
- Seeing World Dynamics in a Nutshell☆98Updated last week
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆40Updated 3 months ago
- 🔥OSN in PyTorch (ICML 2024)☆23Updated 7 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆67Updated 5 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆135Updated 2 weeks ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction'☆21Updated last week
- A collection of vision foundation models unifying understanding and generation.☆47Updated 2 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆81Updated this week
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆21Updated last month
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆73Updated 7 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆55Updated 2 months ago