UMass-Embodied-AGI / MultiPLYLinks

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

☆134

Alternatives and similar repositories for MultiPLY

Users that are interested in MultiPLY are comparing it to the libraries listed below

Sorting:

declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆78Updated 6 months ago
Dantong88 / LLARVA
☆60Updated 11 months ago
liufanfanlff / RoboUniview
☆61Updated 9 months ago
SilongYong / SQA3D
[ICLR 2023] SQA3D for embodied scene understanding and reasoning
☆152Updated 2 years ago
changhaonan / A3VLM
[CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`
☆121Updated last year
CladernyJorn / UP-VLA
Official PyTorch implementation for ICML 2025 paper: UP-VLA.
☆51Updated 5 months ago
rainbow979 / robodreamer
☆87Updated last year
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆54Updated 9 months ago
pickxiguapi / Embodied-R1
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
☆108Updated 3 months ago
OpenDriveLab / CLOVER
[NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
☆130Updated 3 months ago
LostXine / LLaRA
[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
☆225Updated 8 months ago
Gabesarch / HELPER
☆32Updated last year
allenai / spoc-robot-training
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
☆142Updated last year
sg-3d / sg3d
☆52Updated last year
GR1-Manipulation / GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆44Updated last year
3dlg-hcvc / hssd
Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.
☆106Updated last year
InternRobotics / F1-VLA
F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
☆137Updated last month
Gary3410 / TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models
☆193Updated 2 years ago
OpenDriveLab / MPI
[RSS 2024] Learning Manipulation by Predicting Interaction
☆116Updated 5 months ago
Zhoues / RoboRefer
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆205Updated last month
SiyuanHuang95 / ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆98Updated last year
MSR3D / MSR3D
[NeurIPS 2024] Official code repository for MSR3D paper
☆68Updated last week
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆152Updated 8 months ago
BeingBeyond / Being-H0
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos
☆182Updated 3 months ago
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆143Updated 11 months ago
InternRobotics / InstructVLA
InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
☆70Updated 2 months ago
google-deepmind / robovqa
☆33Updated last year
InternRobotics / Grounded_3D-LLM
Code&Data for Grounded 3D-LLM with Referent Tokens
☆130Updated 11 months ago
BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆319Updated 2 months ago
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆296Updated 3 weeks ago