Stanford-ILIAD / openvla-mini
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆154Updated 2 weeks ago
Alternatives and similar repositories for openvla-mini:
Users that are interested in openvla-mini are comparing it to the libraries listed below
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆127Updated 5 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆216Updated 2 months ago
- [CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"☆121Updated 2 weeks ago
- Autoregressive Policy for Robot Learning (RA-L 2025)☆105Updated last week
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆101Updated 2 weeks ago
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success☆270Updated last week
- ☆62Updated last month
- ☆321Updated 2 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆243Updated 11 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆200Updated 3 weeks ago
- Unified Video Action Model☆137Updated 2 weeks ago
- ☆169Updated last year
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation☆222Updated last month
- Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"☆157Updated 4 months ago
- Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)☆156Updated last year
- The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"☆69Updated 3 weeks ago
- Code for subgoal synthesis via image editing☆132Updated last year
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes.☆192Updated 2 weeks ago
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆297Updated 7 months ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆152Updated last week
- ☆79Updated 3 weeks ago
- DROID Policy Learning and Evaluation☆175Updated 3 months ago
- 🔥[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆203Updated last week
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks☆101Updated 2 weeks ago
- A unified architecture for multimodal multi-task robotic policy learning.☆140Updated last year
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆125Updated 7 months ago
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆211Updated 7 months ago
- ☆239Updated 7 months ago
- A simple testbed for robotics manipulation policies☆81Updated last month
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆135Updated 2 weeks ago