LatentActionPretraining / LAPA
LAPA: Latent Action Pretraining from Videos
☆136Updated 3 weeks ago
Alternatives and similar repositories for LAPA:
Users that are interested in LAPA are comparing it to the libraries listed below
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆191Updated 8 months ago
- ☆42Updated last month
- ☆56Updated 4 months ago
- Code for subgoal synthesis via image editing☆120Updated last year
- ☆86Updated 5 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆64Updated last month
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆86Updated last week
- ☆65Updated 3 months ago
- ☆69Updated 4 months ago
- [ICCV 2023] Official code repository for ARNOLD benchmark☆149Updated 9 months ago
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆84Updated 3 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆121Updated 4 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆87Updated this week
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆64Updated 6 months ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆65Updated last month
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆71Updated 3 months ago
- Official implementation of GR-MG☆66Updated this week
- LLaRA: Large Language and Robotics Assistant☆163Updated 3 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆54Updated 2 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆96Updated last week
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆83Updated last year
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆45Updated 3 weeks ago
- Code for Reinforcement Learning from Vision Language Foundation Model Feedback☆65Updated 7 months ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆113Updated 4 months ago
- RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆20Updated 3 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆73Updated 5 months ago
- ☆59Updated 2 months ago
- ☆47Updated 3 weeks ago
- Codebase for HiP☆88Updated last year
- Official Implementation of ReALFRED (ECCV'24)☆31Updated 3 months ago