SiyuanHuang95 / ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆84Updated 5 months ago
Alternatives and similar repositories for ManipVQA:
Users that are interested in ManipVQA are comparing it to the libraries listed below
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆104Updated 4 months ago
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆75Updated last month
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆110Updated 7 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆70Updated 7 months ago
- Human Demo Videos to Robot Action Plans☆43Updated 3 months ago
- A simple testbed for robotics manipulation policies☆75Updated this week
- Official implementation of GR-MG☆70Updated last month
- ☆64Updated this week
- Official implementation of CoPa: General Robotic Manipulation through Spatial Constraints of Parts with Foundation Models☆57Updated 3 weeks ago
- ☆60Updated this week
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆95Updated 2 months ago
- Official Code Repo for GENIMA☆65Updated 4 months ago
- code implementation of GraspGPT and FoundationGrasp