kyegomez/RT-X

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/RT-X)

kyegomez / RT-X

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

☆237

Alternatives and similar repositories for RT-X

Users that are interested in RT-X are comparing it to the libraries listed below

Sorting:

google-deepmind / open_x_embodiment
View on GitHub
☆1,682Nov 5, 2025Updated 4 months ago
octo-models / octo
View on GitHub
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
☆1,552Jul 31, 2024Updated last year
google-research / robotics_transformer
View on GitHub
☆1,680Jan 31, 2024Updated 2 years ago
droid-dataset / droid_policy_learning
View on GitHub
DROID Policy Learning and Evaluation
☆270Apr 22, 2025Updated 10 months ago
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆991Dec 20, 2025Updated 2 months ago
kyegomez / RT-2
View on GitHub
Democratization of RT-2 "RT-2: New model translates vision and language into action"
☆551Jul 26, 2024Updated last year
bytedance / GR-1
View on GitHub
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆301Apr 22, 2024Updated last year
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆841Sep 8, 2025Updated 5 months ago
irom-princeton / byovla
View on GitHub
Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024
☆36Jan 22, 2025Updated last year
GR1-Manipulation / GR-1
View on GitHub
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆45Apr 19, 2024Updated last year
rail-berkeley / crossformer
View on GitHub
☆278Aug 26, 2024Updated last year
Large-Trajectory-Model / ATM
View on GitHub
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
☆273Jun 19, 2025Updated 8 months ago
RoboFlamingo / RoboFlamingo
View on GitHub
Code for RoboFlamingo
☆425May 8, 2024Updated last year
dyson-ai / hdp
View on GitHub
[CVPR 2024] Hierarchical Diffusion Policy for Multi-Task Robotic Manipulation
☆228Apr 9, 2024Updated last year
flow-diffusion / AVDC
View on GitHub
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆249Apr 25, 2024Updated last year
NVlabs / RVT
View on GitHub
Official Code for RVT-2 and RVT
☆398Feb 14, 2025Updated last year
huangwl18 / VoxPoser
View on GitHub
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
☆785Feb 20, 2025Updated last year
ARISE-Initiative / robomimic
View on GitHub
robomimic: A Modular Framework for Robot Learning from Demonstration
☆1,309Feb 5, 2026Updated last month
kyegomez / awesome-robotic-foundation-models
View on GitHub
A vast array of Multi-Modal Embodied Robotic Foundation Models!
☆28Mar 18, 2024Updated last year
intuitive-robots / mdt_policy
View on GitHub
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…
☆168Oct 16, 2024Updated last year
EDiRobotics / GR1-Training
View on GitHub
Reimplementation of GR-1, a generalized policy for robotics manipulation.
☆147Sep 4, 2024Updated last year
2toinf / UniAct
View on GitHub
[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"
☆232Nov 6, 2025Updated 4 months ago
real-stanford / diffusion_policy
View on GitHub
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
☆3,820Dec 24, 2024Updated last year
openvla / openvla
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆5,383Mar 23, 2025Updated 11 months ago
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆384Aug 17, 2024Updated last year
UMass-Embodied-AGI / 3D-VLA
View on GitHub
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆623Oct 29, 2024Updated last year
google-research / language-table
View on GitHub
Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.
☆351Feb 20, 2026Updated 2 weeks ago
changhaonan / A3VLM
View on GitHub
[CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`
☆121Oct 7, 2024Updated last year
AlbertTan404 / pytorch-open-x-embodiment
View on GitHub
Data pre-processing and training code on Open-X-Embodiment with pytorch
☆11Jan 20, 2025Updated last year
zhouxian / act3d-chained-diffuser
View on GitHub
A unified architecture for multimodal multi-task robotic policy learning.
☆176Feb 2, 2024Updated 2 years ago
robocasa / robocasa
View on GitHub
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
☆1,165Updated this week
kyegomez / PALM-E
View on GitHub
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
☆335Jan 29, 2024Updated 2 years ago
Genesis-Embodied-AI / RoboGen
View on GitHub
A generative and self-guided robotic agent that endlessly propose and master new skills.
☆1,150May 31, 2024Updated last year
HeegerGao / FLIP
View on GitHub
Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
☆79Dec 12, 2024Updated last year
VoxAct-B / voxactb
View on GitHub
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation (CoRL 2024)
☆52Oct 25, 2024Updated last year
kyegomez / RoboCAT
View on GitHub
Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM
☆87Sep 4, 2023Updated 2 years ago
horipse01 / 3d-foundation-policy
View on GitHub
☆89Sep 23, 2025Updated 5 months ago
Robot-VLAs / RoboVLMs
View on GitHub
☆443Nov 29, 2025Updated 3 months ago
ir413 / mvp
View on GitHub
Masked Visual Pre-training for Robotics
☆245Apr 1, 2023Updated 2 years ago