eric-ai-lab/VLMbench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eric-ai-lab/VLMbench)

eric-ai-lab / VLMbench

NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

☆100

Alternatives and similar repositories for VLMbench

Users that are interested in VLMbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

suraj-nair-1 / lorel
View on GitHub
☆38Mar 10, 2022Updated 4 years ago
eric-ai-lab / FedVLN
View on GitHub
[ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"
☆13Oct 8, 2022Updated 3 years ago
lukashermann / hulc
View on GitHub
Hierarchical Universal Language Conditioned Policies
☆78Mar 19, 2024Updated 2 years ago
haoliuhl / instructrl
View on GitHub
Instruction Following Agents with Multimodal Transforemrs
☆54Nov 3, 2022Updated 3 years ago
clvrai / skill-chaining
View on GitHub
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)
☆37May 3, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ErickRosete / tacorl
View on GitHub
TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning
☆32Jan 26, 2023Updated 3 years ago
stepjam / ARM
View on GitHub
Q-attention (within the ARM system) and coarse-to-fine Q-attention (within C2F-ARM system).
☆192Feb 22, 2024Updated 2 years ago
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆914Sep 8, 2025Updated 8 months ago
haosulab / ManiSkill2-Learn
View on GitHub
☆90May 23, 2024Updated last year
peract / peract
View on GitHub
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
☆492May 9, 2024Updated 2 years ago
wliu88 / StructFormer
View on GitHub
Pytorch code for ICRA 2022 Paper StructFormer
☆46Mar 15, 2022Updated 4 years ago
ChirikjianLab / ravens_visual_foresight
View on GitHub
[IROS 2022] Transporters with Visual Foresight (TVF)
☆11Jul 25, 2022Updated 3 years ago
vimalabs / VIMABench
View on GitHub
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆326Sep 26, 2023Updated 2 years ago
arnold-benchmark / arnold
View on GitHub
[ICCV 2023] ARNOLD: Language-Grounded Robot Manipulation with Continuous Object States in Realistic 3D Scenes
☆184Mar 16, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
stanford-iprl-lab / Concept2Robot
View on GitHub
simulations used in "Concept2Robot: Learning Manipulation Concepts from Instructions and Human Demonstrations"
☆28Jan 1, 2023Updated 3 years ago
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆389Aug 17, 2024Updated last year
facebookresearch / r3m
View on GitHub
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
☆371Mar 21, 2023Updated 3 years ago
siddk / voltron-evaluation
View on GitHub
Voltron Evaluation: Diverse Evaluation Tasks for Robotic Representation Learning
☆38Jul 9, 2023Updated 2 years ago
vlc-robot / hiveformer-corl
View on GitHub
PyTorch implementation of the Hiveformer research paper
☆48Jun 27, 2023Updated 2 years ago
ademiadeniji / lamp
View on GitHub
☆47Jan 29, 2024Updated 2 years ago
real-stanford / scalingup
View on GitHub
[CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down
☆410Aug 12, 2024Updated last year
mees / hulc2
View on GitHub
[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data
☆48Oct 29, 2023Updated 2 years ago
seungyeon-k / Search-for-Grasp-public
View on GitHub
[2023 CoRL] Leveraging 3D Reconstruction for Mechanical Search on Cluttered Shelves
☆11Dec 12, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
prasoongoyal / PixL2R
View on GitHub
☆17Dec 21, 2020Updated 5 years ago
real-stanford / umpnet
View on GitHub
[RA-L / ICRA 2022] UMPNet: Universal Manipulation Policy Network for Articulated Objects
☆59Feb 16, 2022Updated 4 years ago
ayushjain1144 / ebmplanner
View on GitHub
Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"
☆21Jul 4, 2023Updated 2 years ago
siddk / lila
View on GitHub
Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.
☆13Nov 4, 2021Updated 4 years ago
GR1-Manipulation / GR-1
View on GitHub
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆45Apr 19, 2024Updated 2 years ago
rll-research / mosaic
View on GitHub
Code for Paper "Towards More Generalizable One-Shot Visual Imitation Learning", ICRA 2022
☆20May 5, 2022Updated 4 years ago
wentaoyuan / sornet
View on GitHub
Code for SORNet: Spatial Object-Centric Representations for Sequential Manipulation in CoRL 2021 (Best Systems Paper Finalist)
☆48Jun 24, 2022Updated 3 years ago
vlc-robot / hiveformer
View on GitHub
☆33Sep 25, 2024Updated last year
google-research / clevr_robot_env
View on GitHub
CLEVR-Robot: a reinforcement learning environment combining vision, language and control.
☆140Aug 4, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
vimalabs / VIMA
View on GitHub
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆851Apr 18, 2024Updated 2 years ago
MohitShridhar / genima
View on GitHub
Official Code Repo for GENIMA
☆77Oct 29, 2025Updated 6 months ago
zhouxian / act3d-chained-diffuser
View on GitHub
A unified architecture for multimodal multi-task robotic policy learning.
☆179Feb 2, 2024Updated 2 years ago
michaelyuancb / general_flow
View on GitHub
Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"
☆70Dec 20, 2024Updated last year
siddk / voltron-robotics
View on GitHub
Voltron: Language-Driven Representation Learning for Robotics
☆235Jul 9, 2023Updated 2 years ago
NVlabs / SceneCollisionNet
View on GitHub
This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more informati…
☆61Jun 11, 2021Updated 4 years ago
younggyoseo / MWM
View on GitHub
Masked World Models for Visual Control
☆136Jun 11, 2023Updated 2 years ago