Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
☆239Apr 2, 2026Updated 2 weeks ago
Alternatives and similar repositories for RT-X
Users that are interested in RT-X are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆1,756Nov 5, 2025Updated 5 months ago
- Implementation of RT1 (Robotic Transformer) in Pytorch☆447Oct 6, 2024Updated last year
- Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.☆1,611Jul 31, 2024Updated last year
- ☆1,701Jan 31, 2024Updated 2 years ago
- DROID Policy Learning and Evaluation☆276Apr 22, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Democratization of RT-2 "RT-2: New model translates vision and language into action"☆557Jul 26, 2024Updated last year
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆1,032Dec 20, 2025Updated 3 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆45Apr 19, 2024Updated last year
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆307Apr 22, 2024Updated last year
- CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks☆880Sep 8, 2025Updated 7 months ago
- A vast array of Multi-Modal Embodied Robotic Foundation Models!☆28Mar 18, 2024Updated 2 years ago
- ☆282Aug 26, 2024Updated last year
- [CVPR 2024] Hierarchical Diffusion Policy for Multi-Task Robotic Manipulation☆231Apr 9, 2024Updated 2 years ago
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆276Jun 19, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2☆15Jun 27, 2025Updated 9 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆250Apr 25, 2024Updated last year
- Code for RoboFlamingo☆428May 8, 2024Updated last year
- Data pre-processing and training code on Open-X-Embodiment with pytorch☆11Jan 20, 2025Updated last year
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆151Sep 4, 2024Updated last year
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models☆796Feb 20, 2025Updated last year
- Official Code for RVT-2 and RVT☆401Feb 14, 2025Updated last year
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆388Aug 17, 2024Updated last year
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆169Oct 16, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- robomimic: A Modular Framework for Robot Learning from Demonstration☆1,366Feb 5, 2026Updated 2 months ago
- [RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion☆4,006Dec 24, 2024Updated last year
- Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.☆356Apr 9, 2026Updated last week
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆5,874Mar 23, 2025Updated last year
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆37Jan 22, 2025Updated last year
- [CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"☆234Nov 6, 2025Updated 5 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆619Oct 29, 2024Updated last year
- A unified architecture for multimodal multi-task robotic policy learning.☆179Feb 2, 2024Updated 2 years ago
- Implementation of "PaLM-E: An Embodied Multimodal Language Model"☆334Jan 29, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM☆89Sep 4, 2023Updated 2 years ago
- Masked Visual Pre-training for Robotics☆246Apr 1, 2023Updated 3 years ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆121Oct 7, 2024Updated last year
- A PyTorch re-implementation of the RT-1 (Robotics Transformer)☆50Oct 18, 2023Updated 2 years ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆83Dec 12, 2024Updated last year
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots☆1,323Updated this week
- A generative and self-guided robotic agent that endlessly propose and master new skills.☆1,163May 31, 2024Updated last year