goxq / MIFAG-code
Codes of Paper "Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding"
☆15Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for MIFAG-code
- ☆17Updated 3 months ago
- [ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction☆144Updated last month
- Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)☆108Updated 2 weeks ago
- 😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.☆83Updated last week
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆176Updated 2 weeks ago
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆16Updated 4 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆85Updated 4 months ago
- A paper list for Robotics / Embodied AI - Tianxing Chen☆26Updated 2 weeks ago
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆83Updated 4 months ago
- ☆104Updated last year
- This repository is used for advertising PhD recruitment opportunities. Contributions are welcome!☆159Updated 2 months ago
- ☆45Updated last month
- A Visualization Tool for GPU Occupancy on S Cluster.☆13Updated 2 years ago
- [CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D De…☆85Updated 3 months ago
- ☆69Updated 3 weeks ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆152Updated last year
- [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation☆55Updated 3 weeks ago
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆35Updated 4 months ago
- [CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI☆490Updated this week
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆72Updated 2 months ago
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆75Updated 9 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆117Updated last year
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆365Updated last month
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆163Updated last month
- [ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning☆228Updated last year
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆38Updated 4 months ago
- Code&Data for Grounded 3D-LLM with Referent Tokens☆89Updated last month
- A curated list of awesome papers on Embodied AI and related research/industry-driven resources.☆289Updated 3 months ago
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…☆249Updated 4 months ago