NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"
☆98May 8, 2025Updated 9 months ago
Alternatives and similar repositories for VLMbench
Users that are interested in VLMbench are comparing it to the libraries listed below
Sorting:
- ☆38Mar 10, 2022Updated 3 years ago
- Hierarchical Universal Language Conditioned Policies☆77Mar 19, 2024Updated last year
- Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)☆36May 3, 2022Updated 3 years ago
- Instruction Following Agents with Multimodal Transforemrs☆53Nov 3, 2022Updated 3 years ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆30Jan 26, 2023Updated 3 years ago
- CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks☆841Sep 8, 2025Updated 5 months ago
- [ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"☆13Oct 8, 2022Updated 3 years ago
- Q-attention (within the ARM system) and coarse-to-fine Q-attention (within C2F-ARM system).☆191Feb 22, 2024Updated 2 years ago
- simulations used in "Concept2Robot: Learning Manipulation Concepts from Instructions and Human Demonstrations"☆28Jan 1, 2023Updated 3 years ago
- [2023 CoRL] Leveraging 3D Reconstruction for Mechanical Search on Cluttered Shelves☆11Dec 12, 2024Updated last year
- [IROS 2022] Transporters with Visual Foresight (TVF)☆11Jul 25, 2022Updated 3 years ago
- Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation☆483May 9, 2024Updated last year
- PyTorch implementation of the Hiveformer research paper☆48Jun 27, 2023Updated 2 years ago
- Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data☆366Mar 21, 2023Updated 2 years ago
- [RA-L / ICRA 2022] UMPNet: Universal Manipulation Policy Network for Articulated Objects☆59Feb 16, 2022Updated 4 years ago
- Pytorch code for ICRA 2022 Paper StructFormer☆46Mar 15, 2022Updated 3 years ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆45Oct 29, 2023Updated 2 years ago
- [ICCV 2023] ARNOLD: Language-Grounded Robot Manipulation with Continuous Object States in Realistic 3D Scenes☆181Mar 16, 2025Updated 11 months ago
- ☆17Dec 21, 2020Updated 5 years ago
- ☆46Jan 29, 2024Updated 2 years ago
- Code for SORNet: Spatial Object-Centric Representations for Sequential Manipulation in CoRL 2021 (Best Systems Paper Finalist)☆47Jun 24, 2022Updated 3 years ago
- ☆90May 23, 2024Updated last year
- Voltron Evaluation: Diverse Evaluation Tasks for Robotic Representation Learning☆37Jul 9, 2023Updated 2 years ago
- Voltron: Language-Driven Representation Learning for Robotics☆234Jul 9, 2023Updated 2 years ago
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆384Aug 17, 2024Updated last year
- Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆325Sep 26, 2023Updated 2 years ago
- Code for Paper "Towards More Generalizable One-Shot Visual Imitation Learning", ICRA 2022☆20May 5, 2022Updated 3 years ago
- [CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down☆405Aug 12, 2024Updated last year
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆138Aug 4, 2024Updated last year
- This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more informati…☆60Jun 11, 2021Updated 4 years ago
- Masked World Models for Visual Control☆135Jun 11, 2023Updated 2 years ago
- Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.☆14Nov 4, 2021Updated 4 years ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆70Dec 20, 2024Updated last year
- Official Code Repo for GENIMA☆77Oct 29, 2025Updated 4 months ago
- Chain-of-Thought Predictive Control☆57May 1, 2023Updated 2 years ago
- Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"☆21Jul 4, 2023Updated 2 years ago
- A unified architecture for multimodal multi-task robotic policy learning.☆176Feb 2, 2024Updated 2 years ago
- ☆15Aug 9, 2021Updated 4 years ago
- ☆33Sep 25, 2024Updated last year