robo-arena / roboarenaLinks
Distributed, scalable benchmarking of generalist robot policies.
☆35Updated 3 weeks ago
Alternatives and similar repositories for roboarena
Users that are interested in roboarena are comparing it to the libraries listed below
Sorting:
- ☆48Updated 6 months ago
- (RA-L 2025) VILP: Imitation Learning with Latent Video Planning☆19Updated 3 weeks ago
- [ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…☆77Updated 5 months ago
- ☆37Updated 3 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆30Updated 5 months ago
- ☆19Updated 5 months ago
- ☆53Updated 6 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆91Updated last month
- ☆75Updated last month
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆67Updated 7 months ago
- ☆28Updated 2 months ago
- official implementation for our paper Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance (CoRL 2024)☆28Updated 2 months ago
- ☆75Updated 10 months ago
- ☆19Updated last month
- ☆68Updated 8 months ago
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆72Updated 6 months ago
- PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability☆18Updated 3 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆88Updated 11 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆87Updated 3 months ago
- Code for "Steerable Scene Generation with Post Training and Inference-Time Search"☆40Updated last month
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆48Updated last month
- UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations☆50Updated last month
- [ICCV 2025] VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers☆48Updated 2 weeks ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆77Updated last month
- [ICRA 2025] In-Context Imitation Learning via Next-Token Prediction☆84Updated 3 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆63Updated 2 months ago
- ☆49Updated 4 months ago
- Interactive Post-Training for Vision-Language-Action Models☆91Updated last month
- ✨✨Official implementation of BridgeVLA☆95Updated 2 weeks ago
- A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation☆33Updated 3 months ago