aopolin-lv / RoboMP2Links

[ICML 2024] RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models

☆12

Alternatives and similar repositories for RoboMP2

Users that are interested in RoboMP2 are comparing it to the libraries listed below

Sorting:

moojink / rlds_dataset_builder
An example RLDS dataset builder for X-embodiment dataset conversion.
☆50Updated 8 months ago
real-stanford / reflect
[CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction
☆101Updated last year
Nicolinho / RoboVLM
☆47Updated last year
Zxy-MLlab / LIBERO-PRO
LIBERO-PRO is the official repository of the LIBERO-PRO — an evaluation extension of the original LIBERO benchmark
☆105Updated 2 weeks ago
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆151Updated 7 months ago
bytedance / GR-MG
Official implementation of GR-MG
☆92Updated 10 months ago
clorislili / ManipLLM
The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)
☆143Updated last year
sled-group / RACER
[ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning
☆38Updated last year
SiyuanHuang95 / ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆98Updated last year
vlc-robot / hiveformer
☆33Updated last year
whichwhichgone / VLAS
☆39Updated 4 months ago
yufeiwang63 / RL-VLM-F
Code for Reinforcement Learning from Vision Language Foundation Model Feedback
☆126Updated last year
notFoundThisPerson / RoboCAS-v0
☆29Updated last year
markusgrotz / peract_bimanual
Code for PerAct², a language-conditioned imitation learning agent designed for bimanual robotic manipulation using the RLBench environmen…
☆107Updated 9 months ago
kpertsch / rlds_dataset_mod
Efficiently apply modification functions to RLDS/TFDS datasets.
☆36Updated last year
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆199Updated 6 months ago
pipixiaqishi1 / SAM-E
☆47Updated last year
AlbertTan404 / pytorch-open-x-embodiment
Data pre-processing and training code on Open-X-Embodiment with pytorch
☆11Updated 10 months ago
snumprlab / capeam
Official Implementation of CAPEAM (ICCV'23)
☆14Updated last year
hairuoliu1 / ICLR-2025-Robotics
A list of robotics related papers accepted by ICLR'25
☆24Updated 3 months ago
EDiRobotics / mimictest
A simple testbed for robotics manipulation policies
☆103Updated 7 months ago
moka-manipulation / moka
MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)
☆90Updated last year
EDiRobotics / GR1-Training
Reimplementation of GR-1, a generalized policy for robotics manipulation.
☆144Updated last year
abliao / PIVOT-R
[NeurIPS 2024] PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation
☆46Updated last year
AlbertTan404 / RoLD
[MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling
☆22Updated last year
Max-Fu / otter
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆110Updated 7 months ago
Dantong88 / LLARVA
☆60Updated 11 months ago
InternRobotics / Seer
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆262Updated 4 months ago
DeepTimber-Robot-Lab / Paper-Reading-List
☆34Updated last year
Stanford-ILIAD / droc
Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"
☆46Updated last year