OpenGVLab / Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
☆348Updated 6 months ago
Alternatives and similar repositories for Instruct2Act:
Users that are interested in Instruct2Act are comparing it to the libraries listed below
- Code for RoboFlamingo☆334Updated 8 months ago
- Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆290Updated last year
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆396Updated 3 months ago
- [arXiv 2023] Embodied Task Planning with Large Language Models☆167Updated last year
- ☆235Updated this week
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆186Updated 2 weeks ago
- Implementation of "PaLM-E: An Embodied Multimodal Language Model"☆280Updated 11 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆403Updated 2 months ago
- Generating Robotic Simulation Tasks via Large Language Models☆305Updated 9 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆121Updated 4 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆441Updated 2 months ago
- This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…☆136Updated last week
- Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis☆389Updated last month
- Democratization of RT-2 "RT-2: New model translates vision and language into action"☆398Updated 5 months ago
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models☆623Updated 8 months ago
- Benchmarking Knowledge Transfer in Lifelong Robot Learning☆292Updated 2 weeks ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆211Updated 8 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆102Updated 6 months ago
- "MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository☆248Updated 8 months ago
- [CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down☆366Updated 5 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆192Updated 3 weeks ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆638Updated this week
- ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation☆616Updated 4 months ago
- ☆214Updated this week
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆113Updated 4 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆151Updated this week
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆262Updated 5 months ago
- Official Code for RVT-2 and RVT☆302Updated last month
- A Survey of Embodied Learning for Object-Centric Robotic Manipulation☆168Updated 3 months ago
- Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆792Updated 8 months ago