google-deepmind / robovqa
☆21Updated last year
Alternatives and similar repositories for robovqa:
Users that are interested in robovqa are comparing it to the libraries listed below
- ☆64Updated 6 months ago
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆26Updated 5 months ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆90Updated 2 years ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆46Updated 2 months ago
- ☆93Updated 6 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆41Updated last year
- ☆25Updated this week
- Codebase for HiP☆88Updated last year
- ☆35Updated 10 months ago
- ☆43Updated last year
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆27Updated last week
- ☆62Updated 4 months ago
- ☆44Updated 2 months ago
- ☆66Updated 4 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆45Updated 3 months ago
- ☆73Updated 6 months ago
- MiniGrid Implementation of BEHAVIOR Tasks☆40Updated 7 months ago
- ☆37Updated 6 months ago
- ☆45Updated last year
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆43Updated 8 months ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆28Updated 10 months ago
- [EMNLP 2023 (Findings)] This repository contains data processing, evaluation, and fine-tuning code for NEWTON: Are Large Language Models …☆33Updated 4 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆48Updated last month
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆71Updated 7 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆42Updated 2 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆90Updated last month
- Latent Motion Token as the Bridging Language for Robot Manipulation☆74Updated last month
- The official codebase for running the experiments described in the AVDC paper.☆16Updated 5 months ago