Dantong88 / LLARVA
☆42Updated last month
Alternatives and similar repositories for LLARVA:
Users that are interested in LLARVA are comparing it to the libraries listed below
- Latent Motion Token as the Bridging Language for Robot Manipulation☆65Updated last month
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆84Updated 3 months ago
- Official implementation of GR-MG☆66Updated this week
- ☆47Updated 3 weeks ago
- LAPA: Latent Action Pretraining from Videos☆136Updated 3 weeks ago
- ☆86Updated 5 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆45Updated 3 weeks ago
- ☆56Updated 4 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆64Updated 6 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆54Updated 2 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆191Updated 8 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆64Updated last month
- RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆20Updated 3 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆96Updated last week
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆42Updated 6 months ago
- ☆59Updated 2 months ago
- ☆65Updated 3 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆124Updated 2 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆86Updated last week
- ☆27Updated 3 months ago
- ☆21Updated 6 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆43Updated 8 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."☆45Updated last month
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆113Updated 4 months ago
- An official code repository for the paper "Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation"☆57Updated 2 weeks ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆38Updated last year
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆102Updated 6 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆102Updated 3 months ago
- [ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…☆64Updated last month
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆89Updated 6 months ago