goxq / MIFAG-code
Codes of Paper "Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding"
☆15Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for MIFAG-code
- Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)☆103Updated this week
- ☆17Updated 3 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆174Updated this week
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆16Updated 4 months ago
- 😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.☆74Updated this week
- This repository is used for advertising PhD recruitment opportunities. Contributions are welcome!☆159Updated last month
- [ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction☆140Updated last month
- [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation☆53Updated 2 weeks ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆84Updated 4 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆159Updated 3 weeks ago
- ☆66Updated 2 weeks ago
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆80Updated 3 months ago
- SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆76Updated 3 weeks ago
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆75Updated 9 months ago
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆37Updated 3 months ago
- A paper list for Robotics / Embodied AI - Tianxing Chen☆25Updated last week
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆34Updated 4 months ago
- ☆44Updated last month
- Code&Data for Grounded 3D-LLM with Referent Tokens☆89Updated last month
- ☆102Updated last year
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆46Updated last week
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆57Updated 5 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆117Updated last year
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆52Updated last month
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆45Updated 3 weeks ago
- Code of 3DMIT: 3D MULTI-MODAL INSTRUCTION TUNING FOR SCENE UNDERSTANDING☆24Updated 3 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆122Updated 2 weeks ago
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆173Updated 2 months ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆149Updated last year