CladernyJorn/VLM4VLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CladernyJorn/VLM4VLA)

CladernyJorn / VLM4VLA

Implementation of VLM4VLA

☆165

Alternatives and similar repositories for VLM4VLA

Users that are interested in VLM4VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZGC-EmbodyAI / TwinBrainVLA
View on GitHub
☆29May 22, 2026Updated 2 months ago
starVLA / starVLA
View on GitHub
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆3,278Updated this week
Robert-gyj / Ctrl-World
View on GitHub
ICLR 2026 Paper: Ctrl-World
☆539Apr 8, 2026Updated 3 months ago
WM-PO / WMPO
View on GitHub
Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
☆227Jan 4, 2026Updated 6 months ago
Spirit-AI-Team / spirit-v1.5
View on GitHub
Spirit-v1.5: A Robotic Foundation Model by Spirit AI
☆625May 29, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆837Jan 23, 2026Updated 6 months ago
allenai / molmospaces
View on GitHub
An end-to-end open ecosystem for robot learning
☆420Updated this week
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,482Apr 19, 2026Updated 3 months ago
aimbot-reticle / openpi0-aimbot
View on GitHub
CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"
☆50Aug 15, 2025Updated 11 months ago
NVlabs / vla0
View on GitHub
VLA-0: Building State-of-the-Art VLAs with Zero Modification
☆488Feb 21, 2026Updated 5 months ago
steerable-policies / steerable-policies-bridge
View on GitHub
☆40Feb 16, 2026Updated 5 months ago
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,209Apr 3, 2026Updated 3 months ago
Chaoqi-LIU / oat
View on GitHub
[RSS 2026] Ordered Action Tokenization
☆101Feb 5, 2026Updated 5 months ago
gen-robot / RL4VLA
View on GitHub
☆277Aug 25, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,667Jul 9, 2026Updated 2 weeks ago
WangYixuan12 / interactive_world_sim
View on GitHub
[RSS 2026] Interactive World Simulator for Robot Policy Training and Evaluation
☆277Jun 4, 2026Updated last month
InternRobotics / InternVLA-M1
View on GitHub
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆418Feb 11, 2026Updated 5 months ago
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,794Jan 6, 2026Updated 6 months ago
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆408May 17, 2025Updated last year
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆480Feb 5, 2026Updated 5 months ago
alibaba-damo-academy / RynnVLA-002
View on GitHub
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆1,100Dec 2, 2025Updated 7 months ago
EmbodiedBench / EmbodiedBench
View on GitHub
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆318May 30, 2026Updated last month
cover-vla / cover-vla
View on GitHub
This is the official codebase for paper: Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Acti…
☆59Jul 11, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
buoyancy99 / large-video-planner
View on GitHub
☆256Jan 31, 2026Updated 5 months ago
spacetools / SpaceTools
View on GitHub
code release
☆38Jun 22, 2026Updated last month
lzylucy / 4dgen
View on GitHub
[ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆123Jan 10, 2026Updated 6 months ago
Robbyant / lingbot-vla
View on GitHub
A Pragmatic VLA Foundation Model
☆1,665Jun 11, 2026Updated last month
horipse01 / 3d-foundation-policy
View on GitHub
☆113Jun 2, 2026Updated last month
lihzha / lap
View on GitHub
LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transfer
☆162May 20, 2026Updated 2 months ago
ToruOwO / how-to-peel
View on GitHub
🔪 How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference
☆23Jun 3, 2026Updated last month
hhcaz / e2vla
View on GitHub
☆25Oct 18, 2025Updated 9 months ago
NVIDIA / GR00T-Dreams
View on GitHub
DreamGen: Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
☆591Oct 24, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆709Jun 23, 2025Updated last year
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,311Sep 9, 2025Updated 10 months ago
allenai / molmoact
View on GitHub
Official Repository for MolmoAct
☆376May 11, 2026Updated 2 months ago
umd-huang-lab / tracevla
View on GitHub
☆75Jan 8, 2025Updated last year
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,128Dec 20, 2025Updated 7 months ago
sii-research / tau-0-wm
View on GitHub
☆267Jul 2, 2026Updated 3 weeks ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year