allenai/vla-evaluation-harness

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/vla-evaluation-harness)

allenai / vla-evaluation-harness

One framework to evaluate any VLA model on any robot simulation benchmark.

☆456

Alternatives and similar repositories for vla-evaluation-harness

Users that are interested in vla-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

starVLA / starVLA
View on GitHub
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆3,251Updated this week
NVlabs / RoboLab
View on GitHub
☆384Updated this week
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,126Dec 20, 2025Updated 7 months ago
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆835Jan 23, 2026Updated 5 months ago
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,471Apr 19, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sylvestf / LIBERO-plus
View on GitHub
Official repository of LIBERO-plus, a generalized benchmark for in-depth robustness analysis of vision-language-action models.
☆386Jan 21, 2026Updated 6 months ago
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,196Apr 3, 2026Updated 3 months ago
OpenMOSS / VLABench
View on GitHub
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆452Nov 11, 2025Updated 8 months ago
allenai / molmospaces
View on GitHub
An end-to-end open ecosystem for robot learning
☆415Updated this week
TRI-ML / vla_foundry
View on GitHub
☆418Jun 15, 2026Updated last month
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆2,084Mar 15, 2025Updated last year
Tavish9 / any4lerobot
View on GitHub
🎁 A collection of utilities for LeRobot.
☆1,109Jul 2, 2026Updated 2 weeks ago
dexmal / dexbotic
View on GitHub
Dexbotic: Open-Source Vision-Language-Action Toolbox
☆1,257Jun 25, 2026Updated 3 weeks ago
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,645Jul 9, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
amazon-far / abc
View on GitHub
ABC: Scalable Behavior Cloning with Open Data, Training, and Evaluation
☆272Updated this week
NVlabs / vla0
View on GitHub
VLA-0: Building State-of-the-Art VLAs with Zero Modification
☆488Feb 21, 2026Updated 5 months ago
lihzha / lap
View on GitHub
LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transfer
☆160May 20, 2026Updated 2 months ago
2toinf / X-VLA
View on GitHub
[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
☆693Jun 10, 2026Updated last month
robocasa / robocasa
View on GitHub
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
☆1,558Jul 8, 2026Updated last week
capgym / cap-x
View on GitHub
A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation
☆651May 28, 2026Updated last month
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,303Sep 9, 2025Updated 10 months ago
RLinf / RLinf
View on GitHub
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
☆4,179Updated this week
Zxy-MLlab / LIBERO-PRO
View on GitHub
LIBERO-PRO is the official repository of the LIBERO-PRO — an evaluation extension of the original LIBERO benchmark
☆284Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
X-Square-Robot / wall-x
View on GitHub
Building General-Purpose Robots Based on Embodied Foundation Model
☆1,180Updated this week
Physical-Intelligence / openpi
View on GitHub
☆12,909Jun 16, 2026Updated last month
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,787Jan 6, 2026Updated 6 months ago
OpenGalaxea / GalaxeaVLA
View on GitHub
Galaxea's open-source VLA repository
☆689Jul 11, 2026Updated last week
LUOyk1999 / SimVLA
View on GitHub
Implementation of "SimVLA: A Simple VLA Baseline for Robotic Manipulation"
☆141Feb 26, 2026Updated 4 months ago
StanfordVL / BEHAVIOR-1K
View on GitHub
BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx
☆1,585Updated this week
arhanjain / polaris
View on GitHub
A real2sim evaluation framework for generalist policies
☆221Jul 13, 2026Updated last week
arhanjain / sim-evals
View on GitHub
A simulation evaluation platform for DROID
☆234Mar 16, 2026Updated 4 months ago
isaac-sim / IsaacLab-Arena
View on GitHub
Isaac Lab - Arena is a robotics simulation framework that enhances NVIDIA Isaac Lab by providing a composable, scalable system for creati…
☆491Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
allenai / molmoact
View on GitHub
Official Repository for MolmoAct
☆376May 11, 2026Updated 2 months ago
Spirit-AI-Team / spirit-v1.5
View on GitHub
Spirit-v1.5: A Robotic Foundation Model by Spirit AI
☆625May 29, 2026Updated last month
OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆1,110Nov 19, 2025Updated 8 months ago
IliaLarchenko / behavior-1k-solution
View on GitHub
1st place solution of 2025 BEHAVIOR Challenge
☆310Jan 24, 2026Updated 5 months ago
RoboVerseOrg / RoboVerse
View on GitHub
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
☆1,786Updated this week
bjrobotnewbie / VLAExplain
View on GitHub
VLA model interpretability tools
☆175Mar 30, 2026Updated 3 months ago
thu-ml / Motus
View on GitHub
Official code of Motus: A Unified Latent Action World Model
☆1,211Jan 5, 2026Updated 6 months ago