LostXine / LLaRA
🔥[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
☆197Updated last week
Alternatives and similar repositories for LLaRA:
Users that are interested in LLaRA are comparing it to the libraries listed below
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆189Updated 2 weeks ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆199Updated 2 months ago
- ☆46Updated 3 months ago
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆220Updated 5 months ago
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆124Updated 6 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆138Updated 3 weeks ago
- Embodied Reasoning Question Answer (ERQA) Benchmark☆95Updated 2 weeks ago
- Code for subgoal synthesis via image editing☆130Updated last year
- Unified Video Action Model☆123Updated last week
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆95Updated this week
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆127Updated 5 months ago
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆200Updated 3 weeks ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆204Updated 11 months ago
- ☆162Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆145Updated last week
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation☆205Updated last month
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success☆215Updated 2 weeks ago
- ☆56Updated last week
- Official implementation of GR-MG☆76Updated 2 months ago
- DROID Policy Learning and Evaluation☆175Updated 3 months ago
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks☆96Updated last week
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆238Updated 11 months ago
- The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"☆69Updated last week
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆120Updated this week
- [ICCV 2023] Official code repository for ARNOLD benchmark☆157Updated last week
- [CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction☆89Updated last year
- Autoregressive Policy for Robot Learning (RA-L 2025)☆103Updated 2 weeks ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆109Updated 5 months ago
- ☆62Updated last month
- ☆63Updated 5 months ago