microsoft / GPT4Vision-Robot-Manipulation-Prompts
This repository provides the sample code designed to interpret human demonstration videos and convert them into high-level tasks for robots.
☆36Updated 4 months ago
Alternatives and similar repositories for GPT4Vision-Robot-Manipulation-Prompts:
Users that are interested in GPT4Vision-Robot-Manipulation-Prompts are comparing it to the libraries listed below
- ☆42Updated this week
- IROS 2023 "VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes"☆38Updated 11 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆86Updated 7 months ago
- Sim-Grasp offers a simulation framework to generate synthetic data and train models for robotic two finger grasping in cluttered environm…☆24Updated 10 months ago
- VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation (CoRL 2024)☆39Updated 5 months ago
- Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning☆57Updated last month
- Official implementation for paper "Adaptive Compliance Policy Learning Approximate Compliance for Diffusion Guided Control".☆54Updated 5 months ago
- UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects☆41Updated last month
- Official Code Repo for GENIMA☆70Updated 5 months ago
- ☆81Updated 4 months ago
- ☆39Updated 4 months ago
- code implementation of GraspGPT and FoundationGrasp☆110Updated 2 weeks ago
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆78Updated 3 months ago
- Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".☆137Updated 8 months ago
- This is the repo of "Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning"☆47Updated 3 months ago
- Official code for CVPR'23 paper: Learning Human-to-Robot Handovers from Point Clouds☆90Updated 3 weeks ago
- Language/Clicking grounded SAM + VOS for real-time video object tracking☆18Updated 2 months ago
- Code for the paper: "Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation"☆31Updated last week
- RLAfford: End-to-End Affordance Learning for Robotic Manipulation, ICRA 2023☆108Updated 11 months ago
- Official Hardware Codebase for the Paper "BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Ac…☆76Updated last week
- Code for paper "Diff-Control: A stateful Diffusion-based Policy for Imitation Learning" (Liu et al., IROS 2024)☆51Updated 5 months ago
- Official implementation of Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics☆34Updated 2 weeks ago
- Waypoint-Based Imitation Learning for Robotic Manipulation☆109Updated last year
- Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation☆51Updated 3 weeks ago
- [CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction☆89Updated last year
- Human Demo Videos to Robot Action Plans☆46Updated 4 months ago
- Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]☆24Updated 3 months ago
- ☆49Updated last month
- [CoRL2024] ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter. https://arxiv.org/abs/2407.11298☆67Updated last week
- Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"☆44Updated 9 months ago