OpenHelix-Team / LLaVA-VLALinks

LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]

☆158

Alternatives and similar repositories for LLaVA-VLA

Users that are interested in LLaVA-VLA are comparing it to the libraries listed below

Sorting:

InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Grounded Foundation Model for Generalist Robot Policy
☆125Updated last week
PKU-HMI-Lab / Hybrid-VLA
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆289Updated last week
Zhangwenyao1 / DreamVLA
[NeurIPS2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆188Updated 3 weeks ago
baaivision / UniVLA
Unified Vision-Language-Action Model
☆203Updated 2 months ago
EmbodiedCity / Embodied-R.code
☆83Updated 4 months ago
Zhoues / RoboRefer
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆166Updated last week
alibaba-damo-academy / WorldVLA
WorldVLA: Towards Autoregressive Action World Model
☆435Updated last month
RoboDita / Dita
ICCV2025
☆135Updated last month
qizekun / SoFar
[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
☆192Updated 3 months ago
JiuTian-VL / Large-VLM-based-VLA-for-Robotic-Manipulation
A curated list of large VLM-based VLA models for robotic manipulation.
☆196Updated last week
pickxiguapi / Embodied-R1
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
☆83Updated last month
BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆310Updated 3 weeks ago
hume-vla / hume
🦾 A Dual-System VLA with System2 Thinking
☆112Updated last month
GuanxingLu / vlarl
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆300Updated 3 weeks ago
InternRobotics / F1-VLA
F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
☆107Updated last month
HeegerGao / VLA-OS
Official Code For VLA-OS.
☆111Updated 3 months ago
Psi-Robot / Awesome-VLA-Papers
Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective
☆271Updated 3 months ago
DelinQu / awesome-vision-language-action-model
Latest Advances on Vison-Language-Action Models.
☆112Updated 7 months ago
InternRobotics / StreamVLN
Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
☆240Updated last week
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆138Updated 9 months ago
liufanfanlff / RoboUniview
☆57Updated 7 months ago
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆111Updated 7 months ago
SpatialVLA / SpatialVLA
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆514Updated 3 months ago
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆187Updated 4 months ago
OpenDriveLab / CLOVER
[NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
☆128Updated last month
WayneMao / RoboMatrix
The Official Implementation of RoboMatrix
☆97Updated 4 months ago
PKU-HMI-Lab / LIFT3D
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
☆161Updated 3 months ago
Koorye / Inspire
Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"
☆45Updated last week
InternRobotics / InternSR
InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.
☆41Updated 3 weeks ago
InternRobotics / InternNav
InternRobotics' open platform for building generalized navigation foundation models.
☆323Updated this week