OpenGVLab / Instruct2ActLinks

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

☆370

Alternatives and similar repositories for Instruct2Act

Users that are interested in Instruct2Act are comparing it to the libraries listed below

Sorting:

kyegomez / PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
☆330Updated last year
RoboFlamingo / RoboFlamingo
Code for RoboFlamingo
☆409Updated last year
Gary3410 / TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models
☆192Updated 2 years ago
kyegomez / RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
☆229Updated last week
vimalabs / VIMABench
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆320Updated 2 years ago
kyegomez / RT-2
Democratization of RT-2 "RT-2: New model translates vision and language into action"
☆529Updated last year
liruiw / GenSim
Generating Robotic Simulation Tasks via Large Language Models
☆339Updated last year
MichalZawalski / embodied-CoT
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆323Updated 7 months ago
huangwl18 / VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
☆750Updated 9 months ago
UMass-Embodied-AGI / 3D-VLA
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆593Updated last year
clorislili / ManipLLM
The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)
☆143Updated last year
Robot-VLAs / RoboVLMs
☆409Updated 9 months ago
NVlabs / RVT
Official Code for RVT-2 and RVT
☆386Updated 9 months ago
bytedance / GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆286Updated last year
H-Freax / Awesome-Video-Robotic-Papers
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…
☆168Updated 9 months ago
FlagOpen / RoboBrain
[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.
☆341Updated last month
embodied-generalist / embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
☆465Updated 7 months ago
rail-berkeley / bridge_data_v2
☆238Updated last year
real-stanford / scalingup
[CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down
☆402Updated last year
mmrobotlab / DailyRobot
☆83Updated 2 years ago
RayYoh / OCRM_survey
A Survey of Embodied Learning for Object-Centric Robotic Manipulation
☆244Updated last year
microsoft / CogACT
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆374Updated 3 weeks ago
j96w / MimicPlay
"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository
☆300Updated last year
embodied-agent-interface / embodied-agent-interface
Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)
☆266Updated 8 months ago
vimalabs / VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆835Updated last year
WayneMao / RoboMatrix
The Official Implementation of RoboMatrix
☆102Updated 6 months ago
NVlabs / progprompt-vh
ProgPrompt for Virtualhome
☆144Updated 2 years ago
SiyuanHuang95 / ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆97Updated last year
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆139Updated 10 months ago
JeffreyYH / Awesome-Generalist-Robots-via-Foundation-Models
Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
☆448Updated last month