One framework to evaluate any VLA model on any robot simulation benchmark.
☆336Jun 1, 2026Updated last week
Alternatives and similar repositories for vla-evaluation-harness
Users that are interested in vla-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VLA model interpretability tools☆172Mar 30, 2026Updated 2 months ago
- code release☆32May 7, 2026Updated last month
- [ICML 2025] The Official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"☆30Dec 15, 2025Updated 5 months ago
- InternDataEngine: Pioneering High-Fidelity Synthetic Data Generator for Robotic Manipulation☆111Mar 20, 2026Updated 2 months ago
- Official repo for "StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation"☆25Apr 22, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Isaac Lab repository for LEAP Hand V1☆37Sep 4, 2025Updated 9 months ago
- [ICLR 2026] - One2Scene☆42May 25, 2026Updated 2 weeks ago
- ☆133Apr 29, 2026Updated last month
- CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction [ICRA 2025]☆18Oct 20, 2025Updated 7 months ago
- Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning☆24Jun 25, 2025Updated 11 months ago
- [RA-L with ICRA2023] TransDSSL: Transformer based Depth Estimation via Self-Supervised Learning☆12Jan 11, 2023Updated 3 years ago
- Open-source code of the paper: Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions.☆187Nov 11, 2025Updated 6 months ago
- Benchmarking Knowledge Transfer in Lifelong Robot Learning☆1,907Mar 15, 2025Updated last year
- ☆91Apr 14, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the official codebase for paper: Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Acti…☆54Apr 9, 2026Updated 2 months ago
- AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World | CoRL 2025☆97Mar 26, 2026Updated 2 months ago
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots☆1,434May 21, 2026Updated 2 weeks ago
- Actuated Version of the Universal Manipulation Interface Gripper☆30May 6, 2025Updated last year
- URDF/xacro description files for the OpenArm robot system☆35Updated this week
- LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transfer☆142May 20, 2026Updated 2 weeks ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆438Nov 11, 2025Updated 6 months ago
- download all oral & spotlight papers from neurips, iclr, icml or any openreview conference☆28Apr 26, 2026Updated last month
- ☆114Oct 27, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- world modeling challenge for humanoid robots☆557Nov 8, 2024Updated last year
- RoboFAC: A Comprehensive Framework for Robotic Failure Analysis and Correction☆32Dec 10, 2025Updated 5 months ago
- Video-Action Models for Generalizable Robot Control Beyond VLAs☆263May 10, 2026Updated 3 weeks ago
- [CVPR 2026] WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories (WorldExpand of HY-Wo…☆164Apr 24, 2026Updated last month
- A real2sim evaluation framework for generalist policies☆195Mar 23, 2026Updated 2 months ago
- Open data set for Robotics Lab, Institute of Cyber-Systems and Control Zhejiang University☆15Oct 26, 2020Updated 5 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- ☆61Apr 15, 2025Updated last year
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success☆1,240Sep 9, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 10 months ago
- InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation☆411Feb 27, 2026Updated 3 months ago
- ☆17Dec 5, 2024Updated last year
- KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts☆19Jun 21, 2022Updated 3 years ago
- Implementations of path planning algorithms☆15Apr 4, 2021Updated 5 years ago
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆51Nov 27, 2025Updated 6 months ago
- KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation☆22Apr 23, 2025Updated last year