One framework to evaluate any VLA model on any robot simulation benchmark.
☆386Jun 23, 2026Updated this week
Alternatives and similar repositories for vla-evaluation-harness
Users that are interested in vla-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] The Official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"☆30Dec 15, 2025Updated 6 months ago
- VLA model interpretability tools☆174Mar 30, 2026Updated 2 months ago
- code release☆35Jun 22, 2026Updated last week
- AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World | CoRL 2025☆97Mar 26, 2026Updated 3 months ago
- [NeurIPS 2025] Official code repository for "Failure Prediction at Runtime for Generative Robot Policies".☆45Nov 3, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- InternDataEngine: Pioneering High-Fidelity Synthetic Data Generator for Robotic Manipulation☆116Mar 20, 2026Updated 3 months ago
- Official repo for "StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation"☆27Apr 22, 2026Updated 2 months ago
- ☆73Jun 18, 2026Updated last week
- [ICLR 2026] - One2Scene☆47May 25, 2026Updated last month
- ☆142Apr 29, 2026Updated 2 months ago
- A curated list of papers and selected technical blogs on Loop Models.☆177Jun 22, 2026Updated last week
- CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction [ICRA 2025]☆18Oct 20, 2025Updated 8 months ago
- Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning☆25Jun 25, 2025Updated last year
- Open-source code of the paper: Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions.☆191Nov 11, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Repository of “MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆59Mar 9, 2026Updated 3 months ago
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots☆1,479Jun 19, 2026Updated last week
- ☆41Aug 27, 2024Updated last year
- Benchmarking Knowledge Transfer in Lifelong Robot Learning☆1,983Mar 15, 2025Updated last year
- A Benchmark for Evaluating Generalization for Robotic Manipulation☆149Mar 3, 2025Updated last year
- The repo contains the code and dataset for the World Models Track of GigaBrain Challenge 2026 CVPR Workshop.☆59Apr 8, 2026Updated 2 months ago
- Actuated Version of the Universal Manipulation Interface Gripper☆30May 6, 2025Updated last year
- A simulation evaluation platform for DROID☆222Mar 16, 2026Updated 3 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆443Nov 11, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- download all oral & spotlight papers from neurips, iclr, icml or any openreview conference☆28Apr 26, 2026Updated 2 months ago
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.☆697Jun 23, 2025Updated last year
- Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment☆297May 12, 2026Updated last month
- A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…☆10Mar 18, 2025Updated last year
- ☆116Oct 27, 2025Updated 8 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆403Apr 5, 2025Updated last year
- world modeling challenge for humanoid robots☆559Nov 8, 2024Updated last year
- RoboFAC: A Comprehensive Framework for Robotic Failure Analysis and Correction☆35Dec 10, 2025Updated 6 months ago
- Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning☆60Mar 16, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A web UI interface on top of lerobot☆22Apr 29, 2026Updated 2 months ago
- Repository for learning a policy to get a humanoid robot to stand up☆19Feb 13, 2025Updated last year
- ☆11Jan 8, 2025Updated last year
- Mixed complementarity problems parameterized by "runtime"-parameters with support for implicit differentiation.☆20Updated this week
- [CVPR 2026] WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories (WorldExpand of HY-Wo…☆167Apr 24, 2026Updated 2 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Jul 3, 2024Updated last year
- A real2sim evaluation framework for generalist policies☆201Mar 23, 2026Updated 3 months ago