microsoft / GPT4Vision-Robot-Manipulation-PromptsLinks

This repository provides the sample code designed to interpret human demonstration videos and convert them into high-level tasks for robots.

☆42

Alternatives and similar repositories for GPT4Vision-Robot-Manipulation-Prompts

Users that are interested in GPT4Vision-Robot-Manipulation-Prompts are comparing it to the libraries listed below

Sorting:

PPjmchen / vlmpc
☆55Updated 4 months ago
ZhihaoAIRobotic / awesome-robot-visual-imitation-learning
A collection of papers, codes and talks of visual imitation learning/imitation learning from video for robotics.
☆77Updated 2 years ago
omron-sinicx / ViLaIn
An official implementation of Vision-Language Interpreter (ViLaIn)
☆39Updated last year
mihdalal / manipgen
[ICRA 2025] PyTorch Code for Local Policies Enable Zero-shot Long-Horizon Manipulation
☆115Updated 3 months ago
sjtuyinjie / Awesome-Grasp-List
A curated list of awesome open-source grasping libraries and resources
☆59Updated 3 weeks ago
Robot-MA / manipulate-anything
Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]
☆42Updated 4 months ago
yay-robot / yay_robot
PyTorch implementation of YAY Robot
☆150Updated last year
SiyuanHuang95 / ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆98Updated 11 months ago
behavior-robot-suite / brs-ctrl
Official Hardware Codebase for the Paper "BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Ac…
☆101Updated 4 months ago
michaal94 / MuBlE
☆30Updated 9 months ago
gemcollector / maniwhere
This is the repo of "Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning"
☆69Updated 7 months ago
Learning-and-Intelligent-Systems / kitchen-worlds
A library of long-horizon Task-and-Motion-Planning (TAMP) problems in kitchen and household scenes, as well as planners to solve them
☆140Updated 2 months ago
siddhanthaldar / Point-Policy
Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
☆73Updated 2 weeks ago
Soltanilara / av-aloha
Code for the paper: "Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation"
☆42Updated 4 months ago
ACETeleop / ACETeleop
ACE: A Cross-platform Visual-Exoskeletons for Low-Cost Dexterous Teleoperation
☆110Updated 10 months ago
hyperplane-lab / RLAfford
RLAfford: End-to-End Affordance Learning for Robotic Manipulation, ICRA 2023
☆113Updated last year
ZhengtongXu / UniT
(RA-L 2025) UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects
☆50Updated 3 months ago
MohitShridhar / genima
Official Code Repo for GENIMA
☆74Updated 10 months ago
Stanford-ILIAD / droc
Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"
☆45Updated last year
JunzheJosephZhu / see_hear_feel
☆39Updated 2 months ago
AssassinWS / LLM-TAMP
LLM3: Large Language Model-based Task and Motion Planning with Motion Failure Reasoning
☆90Updated last year
ir-lab / Diff-Control
Code for paper "Diff-Control: A stateful Diffusion-based Policy for Imitation Learning" (Liu et al., IROS 2024)
☆65Updated 2 months ago
mihdalal / planseqlearn
[ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks
☆112Updated 11 months ago
chrisyrniu / Human2LocoMan
☆34Updated this week
rail-berkeley / fmb
☆62Updated last year
mkt1412 / GraspGPT_public
code implementation of GraspGPT and FoundationGrasp
☆122Updated last month
agilexrobotics / mobile_aloha_sim
☆112Updated 3 weeks ago
rail-berkeley / serl_franka_controllers
Cartesian impedance controller with reference limiting for Franka Emika Robot
☆148Updated last year
HaoxuHuang / copa
Official implementation of CoPa: General Robotic Manipulation through Spatial Constraints of Parts with Foundation Models
☆89Updated 6 months ago
fuse-model / FuSe
☆56Updated 6 months ago