microsoft / GPT4Vision-Robot-Manipulation-Prompts
This repository provides the sample code designed to interpret human demonstration videos and convert them into high-level tasks for robots.
☆38Updated 5 months ago
Alternatives and similar repositories for GPT4Vision-Robot-Manipulation-Prompts:
Users that are interested in GPT4Vision-Robot-Manipulation-Prompts are comparing it to the libraries listed below
- Sim-Grasp offers a simulation framework to generate synthetic data and train models for robotic two finger grasping in cluttered environm…☆29Updated 11 months ago
- Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"☆44Updated 10 months ago
- code implementation of GraspGPT and FoundationGrasp☆111Updated last month
- Official implementation of GROOT, CoRL 2023☆57Updated last year
- [CoRL2024] ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter. https://arxiv.org/abs/2407.11298☆67Updated last month
- Human Demo Videos to Robot Action Plans☆48Updated 5 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆89Updated 8 months ago
- This is the repo of "Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning"☆52Updated 4 months ago
- (RA-L 2025) UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects☆45Updated last week
- Official implementation of CoPa: General Robotic Manipulation through Spatial Constraints of Parts with Foundation Models☆73Updated 3 months ago
- Official Code Repo for GENIMA☆70Updated 6 months ago
- Official code for CVPR'23 paper: Learning Human-to-Robot Handovers from Point Clouds☆96Updated last month
- This is the official implementation of RoboBERT, which is a novel end-to-end mutiple-modality robotic operations training framework.☆48Updated last month
- RLAfford: End-to-End Affordance Learning for Robotic Manipulation, ICRA 2023☆110Updated last year
- Language/Clicking grounded SAM + VOS for real-time video object tracking☆19Updated 3 months ago
- IROS 2023 "VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes"☆40Updated last year
- ☆38Updated last year
- VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation (CoRL 2024)☆42Updated 6 months ago
- Code for the paper: "Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation"☆33Updated last month
- This repo contains the official implementation of CoRL2023 paper "Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in…☆13Updated last year
- Code for paper "Diff-Control: A stateful Diffusion-based Policy for Imitation Learning" (Liu et al., IROS 2024)☆53Updated 5 months ago
- Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]☆28Updated 3 weeks ago
- Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".☆143Updated 9 months ago
- PyTorch implementation of YAY Robot☆139Updated last year
- [CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction☆94Updated last year
- Waypoint-Based Imitation Learning for Robotic Manipulation☆112Updated last year
- ☆107Updated 2 years ago
- A simple testbed for robotics manipulation policies☆82Updated 2 weeks ago
- [ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks☆98Updated 8 months ago
- Action Chunking Transformers with In-the-Wild Learning Framework☆19Updated last year