vimalabs/VIMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vimalabs/VIMA)

vimalabs / VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

☆845

Alternatives and similar repositories for VIMA

Users that are interested in VIMA are comparing it to the libraries listed below

Sorting:

vimalabs / VIMABench
View on GitHub
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆325Sep 26, 2023Updated 2 years ago
OpenGVLab / Instruct2Act
View on GitHub
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
☆373Jun 23, 2024Updated last year
peract / peract
View on GitHub
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
☆483May 9, 2024Updated last year
cliport / cliport
View on GitHub
CLIPort: What and Where Pathways for Robotic Manipulation
☆539Nov 2, 2023Updated 2 years ago
google-research / robotics_transformer
View on GitHub
☆1,678Jan 31, 2024Updated 2 years ago
facebookresearch / r3m
View on GitHub
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
☆366Mar 21, 2023Updated 2 years ago
real-stanford / scalingup
View on GitHub
[CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down
☆405Aug 12, 2024Updated last year
huangwl18 / VoxPoser
View on GitHub
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
☆784Feb 20, 2025Updated last year
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆841Sep 8, 2025Updated 5 months ago
stepjam / RLBench
View on GitHub
A large-scale benchmark and learning environment.
☆1,702Jan 25, 2025Updated last year
GT-RIPL / Awesome-LLM-Robotics
View on GitHub
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
☆4,283Jan 27, 2026Updated last month
octo-models / octo
View on GitHub
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
☆1,552Jul 31, 2024Updated last year
j96w / MimicPlay
View on GitHub
"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository
☆306Apr 23, 2024Updated last year
robocasa / robocasa
View on GitHub
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
☆1,134Updated this week
facebookresearch / vip
View on GitHub
Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"
☆180Oct 19, 2023Updated 2 years ago
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆980Dec 20, 2025Updated 2 months ago
ARISE-Initiative / robomimic
View on GitHub
robomimic: A Modular Framework for Robot Learning from Demonstration
☆1,309Feb 5, 2026Updated 3 weeks ago
google-research / ravens
View on GitHub
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
☆621Jul 30, 2024Updated last year
ir413 / mvp
View on GitHub
Masked Visual Pre-training for Robotics
☆245Apr 1, 2023Updated 2 years ago
NVlabs / mimicgen
View on GitHub
This code corresponds to simulation environments used as part of the MimicGen project.
☆548Aug 16, 2025Updated 6 months ago
Genesis-Embodied-AI / RoboGen
View on GitHub
A generative and self-guided robotic agent that endlessly propose and master new skills.
☆1,149May 31, 2024Updated last year
google-research / language-table
View on GitHub
Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.
☆351Feb 20, 2026Updated last week
real-stanford / diffusion_policy
View on GitHub
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
☆3,796Dec 24, 2024Updated last year
penn-pal-lab / LIV
View on GitHub
Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)
☆130Oct 19, 2023Updated 2 years ago
microsoft / PromptCraft-Robotics
View on GitHub
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
☆2,089Jan 20, 2024Updated 2 years ago
NVlabs / RVT
View on GitHub
Official Code for RVT-2 and RVT
☆398Feb 14, 2025Updated last year
lukashermann / hulc
View on GitHub
Hierarchical Universal Language Conditioned Policies
☆77Mar 19, 2024Updated last year
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆384Aug 17, 2024Updated last year
haosulab / ManiSkill
View on GitHub
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
☆2,595Jan 31, 2026Updated last month
eric-ai-lab / VLMbench
View on GitHub
NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"
☆98May 8, 2025Updated 9 months ago
flow-diffusion / AVDC
View on GitHub
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆248Apr 25, 2024Updated last year
liruiw / GenSim
View on GitHub
Generating Robotic Simulation Tasks via Large Language Models
☆347Mar 23, 2024Updated last year
facebookresearch / eai-vc
View on GitHub
The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).
☆499May 1, 2024Updated last year
google-deepmind / open_x_embodiment
View on GitHub
☆1,682Nov 5, 2025Updated 3 months ago
robopen / roboagent
View on GitHub
Repository to train and evaluate RoboAgent
☆360Apr 2, 2024Updated last year
RoboFlamingo / RoboFlamingo
View on GitHub
Code for RoboFlamingo
☆424May 8, 2024Updated last year
arnold-benchmark / arnold
View on GitHub
[ICCV 2023] ARNOLD: Language-Grounded Robot Manipulation with Continuous Object States in Realistic 3D Scenes
☆181Mar 16, 2025Updated 11 months ago
huangwl18 / ReKep
View on GitHub
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
☆911Feb 20, 2025Updated last year
siddk / voltron-robotics
View on GitHub
Voltron: Language-Driven Representation Learning for Robotics
☆234Jul 9, 2023Updated 2 years ago