kyegomez/RT-2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/RT-2)

kyegomez / RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

☆551

Alternatives and similar repositories for RT-2

Users that are interested in RT-2 are comparing it to the libraries listed below

Sorting:

google-research / robotics_transformer
View on GitHub
☆1,680Jan 31, 2024Updated 2 years ago
kyegomez / RT-X
View on GitHub
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
☆237Feb 20, 2026Updated last week
octo-models / octo
View on GitHub
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
☆1,552Jul 31, 2024Updated last year
google-deepmind / open_x_embodiment
View on GitHub
☆1,682Nov 5, 2025Updated 3 months ago
huangwl18 / VoxPoser
View on GitHub
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
☆784Feb 20, 2025Updated last year
kyegomez / PALM-E
View on GitHub
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
☆335Jan 29, 2024Updated 2 years ago
GT-RIPL / Awesome-LLM-Robotics
View on GitHub
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
☆4,283Jan 27, 2026Updated last month
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆980Dec 20, 2025Updated 2 months ago
openvla / openvla
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆5,383Mar 23, 2025Updated 11 months ago
kyegomez / RoboCAT
View on GitHub
Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM
☆87Sep 4, 2023Updated 2 years ago
bytedance / GR-1
View on GitHub
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆300Apr 22, 2024Updated last year
google-research / language-table
View on GitHub
Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.
☆351Feb 20, 2026Updated last week
UMass-Embodied-AGI / 3D-VLA
View on GitHub
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆622Oct 29, 2024Updated last year
NVlabs / RVT
View on GitHub
Official Code for RVT-2 and RVT
☆398Feb 14, 2025Updated last year
flow-diffusion / AVDC
View on GitHub
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆248Apr 25, 2024Updated last year
thu-ml / RoboticsDiffusionTransformer
View on GitHub
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
☆1,625Jan 21, 2026Updated last month
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆841Sep 8, 2025Updated 5 months ago
facebookresearch / home-robot
View on GitHub
Mobile manipulation research tools for roboticists
☆1,189Jun 8, 2024Updated last year
vimalabs / VIMA
View on GitHub
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆844Apr 18, 2024Updated last year
huangwl18 / ReKep
View on GitHub
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
☆911Feb 20, 2025Updated last year
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆1,517Mar 15, 2025Updated 11 months ago
real-stanford / diffusion_policy
View on GitHub
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
☆3,796Dec 24, 2024Updated last year
facebookresearch / r3m
View on GitHub
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
☆366Mar 21, 2023Updated 2 years ago
Genesis-Embodied-AI / RoboGen
View on GitHub
A generative and self-guided robotic agent that endlessly propose and master new skills.
☆1,150May 31, 2024Updated last year
allenzren / open-pi-zero
View on GitHub
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
☆1,397Jan 31, 2025Updated last year
RoboFlamingo / RoboFlamingo
View on GitHub
Code for RoboFlamingo
☆424May 8, 2024Updated last year
rail-berkeley / bridge_data_v2
View on GitHub
☆264Mar 17, 2024Updated last year
vimalabs / VIMABench
View on GitHub
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆325Sep 26, 2023Updated 2 years ago
stepjam / RLBench
View on GitHub
A large-scale benchmark and learning environment.
☆1,702Jan 25, 2025Updated last year
haosulab / ManiSkill
View on GitHub
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
☆2,595Jan 31, 2026Updated last month
droid-dataset / droid_policy_learning
View on GitHub
DROID Policy Learning and Evaluation
☆270Apr 22, 2025Updated 10 months ago
ARISE-Initiative / robosuite
View on GitHub
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
☆2,230Updated this week
YanjieZe / Improved-3D-Diffusion-Policy
View on GitHub
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
☆506Jun 16, 2025Updated 8 months ago
peract / peract
View on GitHub
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
☆483May 9, 2024Updated last year
OpenGVLab / Instruct2Act
View on GitHub
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
☆373Jun 23, 2024Updated last year
kyegomez / AutoRT
View on GitHub
Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"
☆42Nov 11, 2024Updated last year
graspnet / anygrasp_sdk
View on GitHub
☆762Nov 23, 2025Updated 3 months ago
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆384Aug 17, 2024Updated last year
YanjieZe / 3D-Diffusion-Policy
View on GitHub
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
☆1,262Oct 17, 2025Updated 4 months ago