DelinQu / awesome-vision-language-action-modelLinks

Latest Advances on Vison-Language-Action Models.

☆119

Alternatives and similar repositories for awesome-vision-language-action-model

Users that are interested in awesome-vision-language-action-model are comparing it to the libraries listed below

Sorting:

PKU-HMI-Lab / Hybrid-VLA
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆322Updated 2 months ago
MichalZawalski / embodied-CoT
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆332Updated 8 months ago
microsoft / CogACT
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆383Updated last month
Stanford-ILIAD / openvla-mini
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆300Updated 8 months ago
JiuTian-VL / Large-VLM-based-VLA-for-Robotic-Manipulation
A curated list of large VLM-based VLA models for robotic manipulation.
☆269Updated 2 weeks ago
yueen-ma / Awesome-VLA
☆374Updated last month
OpenMOSS / VLABench
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆336Updated 3 weeks ago
GuanxingLu / vlarl
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆360Updated 3 weeks ago
SpatialVLA / SpatialVLA
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆578Updated 5 months ago
Robot-VLAs / RoboVLMs
☆411Updated last week
OpenHelix-Team / OpenHelix
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆328Updated 3 months ago
LatentActionPretraining / LAPA
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆414Updated 10 months ago
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆119Updated 9 months ago
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆296Updated 3 weeks ago
starVLA / starVLA
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆513Updated 2 weeks ago
NVlabs / vla0
VLA-0: Building State-of-the-Art VLAs with Zero Modification
☆325Updated 2 weeks ago
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆200Updated 6 months ago
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆151Updated 8 months ago
InternRobotics / VLAC
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
☆237Updated 2 months ago
Psi-Robot / Awesome-VLA-Papers
Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective
☆341Updated 5 months ago
baaivision / UniVLA
Unified Vision-Language-Action Model
☆245Updated last month
Zhoues / RoboRefer
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆205Updated last month
2toinf / UniAct
[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"
☆214Updated last month
OpenGalaxea / G0
Galaxea's first VLA release
☆317Updated last month
embodiedreasoning / ERQA
Embodied Reasoning Question Answer (ERQA) Benchmark
☆245Updated 8 months ago
bytedance / GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆290Updated last year
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆140Updated 11 months ago
JayceWen / tinyvla
☆67Updated 9 months ago
OpenHelix-robot / awesome-dual-system-vla
A comprehensive list of papers about dual-system VLA models, including papers, codes, and related websites.
☆86Updated 2 weeks ago
HeegerGao / VLA-OS
Official Code For VLA-OS.
☆128Updated 5 months ago