PKU-Alignment / VLA-ArenaLinks
VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.
☆41Updated last month
Alternatives and similar repositories for VLA-Arena
Users that are interested in VLA-Arena are comparing it to the libraries listed below
Sorting:
- Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective☆323Updated 4 months ago
- Dexbotic: Open-Source Vision-Language-Action Toolbox☆467Updated 2 weeks ago
- SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning☆987Updated last month
- RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforce…☆1,310Updated this week
- ☆59Updated 7 months ago
- RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation☆250Updated 3 weeks ago
- Building General-Purpose Robots Based on Embodied Foundation Model☆592Updated this week
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆179Updated last month
- OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation☆323Updated 2 months ago
- 🔥This is a curated list of "A survey on Efficient Vision-Language Action Models" research. We will continue to maintain and update the r…☆75Updated last week
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆117Updated 9 months ago
- WorldVLA: Towards Autoregressive Action World Model☆539Updated last month
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆181Updated last month
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆317Updated 2 months ago
- ☆198Updated 2 months ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆318Updated last month
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆172Updated 3 weeks ago
- Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io☆295Updated 6 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆332Updated last week
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆214Updated last month
- Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.☆338Updated last week
- StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing☆433Updated last week
- This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.☆328Updated last month
- Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment☆77Updated this week
- It's not a list of papers, but a list of paper reading lists...☆232Updated 6 months ago
- 🦾 A Dual-System VLA with System2 Thinking☆115Updated 3 months ago
- A curated list of large VLM-based VLA models for robotic manipulation.☆250Updated last week
- Running VLA at 30Hz frame rate and 480Hz trajectory frequency☆242Updated last week
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆92Updated 2 months ago
- ☆358Updated 3 weeks ago