Robot-VLAs/RoboVLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Robot-VLAs/RoboVLMs)

Robot-VLAs / RoboVLMs

☆438

Alternatives and similar repositories for RoboVLMs

Users that are interested in RoboVLMs are comparing it to the libraries listed below

Sorting:

simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆980Dec 20, 2025Updated 2 months ago
bytedance / GR-1
View on GitHub
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆300Apr 22, 2024Updated last year
microsoft / CogACT
View on GitHub
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆406Oct 30, 2025Updated 4 months ago
allenzren / open-pi-zero
View on GitHub
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
☆1,397Jan 31, 2025Updated last year
InternRobotics / Seer
View on GitHub
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆280Jul 8, 2025Updated 7 months ago
thu-ml / RoboticsDiffusionTransformer
View on GitHub
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
☆1,625Jan 21, 2026Updated last month
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,051Sep 9, 2025Updated 5 months ago
bytedance / GR-MG
View on GitHub
Official implementation of GR-MG
☆93Jan 12, 2025Updated last year
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆837Sep 8, 2025Updated 5 months ago
MichalZawalski / embodied-CoT
View on GitHub
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆369Apr 5, 2025Updated 10 months ago
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆657Jun 23, 2025Updated 8 months ago
OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆990Nov 19, 2025Updated 3 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆468Jan 22, 2025Updated last year
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆384Aug 17, 2024Updated last year
openvla / openvla
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆5,317Mar 23, 2025Updated 11 months ago
OpenMOSS / VLABench
View on GitHub
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆387Nov 11, 2025Updated 3 months ago
Stanford-ILIAD / openvla-mini
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆343Mar 19, 2025Updated 11 months ago
juruobenruo / DexVLA
View on GitHub
☆41Apr 15, 2025Updated 10 months ago
OpenHelix-Team / OpenHelix
View on GitHub
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆346Aug 27, 2025Updated 6 months ago
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆163Oct 1, 2025Updated 5 months ago
RoboFlamingo / RoboFlamingo
View on GitHub
Code for RoboFlamingo
☆424May 8, 2024Updated last year
octo-models / octo
View on GitHub
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
☆1,552Jul 31, 2024Updated last year
intuitive-robots / mdt_policy
View on GitHub
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…
☆168Oct 16, 2024Updated last year
Nicolinho / RoboVLM
View on GitHub
☆47May 13, 2024Updated last year
jayLEE0301 / vq_bet_official
View on GitHub
Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)
☆197Feb 28, 2024Updated 2 years ago
OpenDriveLab / AgiBot-World
View on GitHub
[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
☆2,789Dec 16, 2025Updated 2 months ago
UMass-Embodied-AGI / 3D-VLA
View on GitHub
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆622Oct 29, 2024Updated last year
RoboVerseOrg / RoboVerse
View on GitHub
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
☆1,670Updated this week
YanjieZe / 3D-Diffusion-Policy
View on GitHub
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
☆1,262Oct 17, 2025Updated 4 months ago
EDiRobotics / GR1-Training
View on GitHub
Reimplementation of GR-1, a generalized policy for robotics manipulation.
☆147Sep 4, 2024Updated last year
EDiRobotics / mimictest
View on GitHub
A simple testbed for robotics manipulation policies
☆103Apr 13, 2025Updated 10 months ago
Physical-Intelligence / openpi
View on GitHub
☆10,349Dec 27, 2025Updated 2 months ago
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆1,517Mar 15, 2025Updated 11 months ago
liruiw / HPT
View on GitHub
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
☆529Dec 6, 2024Updated last year
Tavish9 / any4lerobot
View on GitHub
🎁 A collection of utilities for LeRobot.
☆873Feb 7, 2026Updated 3 weeks ago
mlzxy / arp
View on GitHub
Autoregressive Policy for Robot Learning (RA-L 2025)
☆147Mar 25, 2025Updated 11 months ago
InternRobotics / InternUtopia
View on GitHub
A simulation platform for versatile Embodied AI research and developments.
☆1,209Sep 4, 2025Updated 5 months ago
MohitShridhar / genima
View on GitHub
Official Code Repo for GENIMA
☆77Oct 29, 2025Updated 4 months ago
haosulab / ManiSkill
View on GitHub
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
☆2,595Jan 31, 2026Updated last month