Democratization of RT-2 "RT-2: New model translates vision and language into action"
☆551Jul 26, 2024Updated last year
Alternatives and similar repositories for RT-2
Users that are interested in RT-2 are comparing it to the libraries listed below
Sorting:
- ☆1,680Jan 31, 2024Updated 2 years ago
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆237Feb 20, 2026Updated last week
- Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.☆1,552Jul 31, 2024Updated last year
- ☆1,682Nov 5, 2025Updated 3 months ago
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models☆784Feb 20, 2025Updated last year
- Implementation of "PaLM-E: An Embodied Multimodal Language Model"☆335Jan 29, 2024Updated 2 years ago
- A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites☆4,283Jan 27, 2026Updated last month
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆980Dec 20, 2025Updated 2 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆5,383Mar 23, 2025Updated 11 months ago
- Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM☆87Sep 4, 2023Updated 2 years ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆300Apr 22, 2024Updated last year
- Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.☆351Feb 20, 2026Updated last week
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆622Oct 29, 2024Updated last year
- Official Code for RVT-2 and RVT☆398Feb 14, 2025Updated last year
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆248Apr 25, 2024Updated last year
- RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation☆1,625Jan 21, 2026Updated last month
- CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks☆841Sep 8, 2025Updated 5 months ago
- Mobile manipulation research tools for roboticists☆1,189Jun 8, 2024Updated last year
- Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆844Apr 18, 2024Updated last year
- ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation☆911Feb 20, 2025Updated last year
- Benchmarking Knowledge Transfer in Lifelong Robot Learning☆1,517Mar 15, 2025Updated 11 months ago
- [RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion☆3,796Dec 24, 2024Updated last year
- Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data☆366Mar 21, 2023Updated 2 years ago
- A generative and self-guided robotic agent that endlessly propose and master new skills.☆1,150May 31, 2024Updated last year
- Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence☆1,397Jan 31, 2025Updated last year
- Code for RoboFlamingo☆424May 8, 2024Updated last year
- ☆264Mar 17, 2024Updated last year
- Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆325Sep 26, 2023Updated 2 years ago
- A large-scale benchmark and learning environment.☆1,702Jan 25, 2025Updated last year
- SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.☆2,595Jan 31, 2026Updated last month
- DROID Policy Learning and Evaluation☆270Apr 22, 2025Updated 10 months ago
- robosuite: A Modular Simulation Framework and Benchmark for Robot Learning☆2,230Updated this week
- [IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3☆506Jun 16, 2025Updated 8 months ago
- Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation☆483May 9, 2024Updated last year
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model☆373Jun 23, 2024Updated last year
- Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"☆42Nov 11, 2024Updated last year
- ☆762Nov 23, 2025Updated 3 months ago
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆384Aug 17, 2024Updated last year
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆1,262Oct 17, 2025Updated 4 months ago