tulerfeng / Awesome-Embodied-Multimodal-LLMsLinks

Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).

☆121

Alternatives and similar repositories for Awesome-Embodied-Multimodal-LLMs

Users that are interested in Awesome-Embodied-Multimodal-LLMs are comparing it to the libraries listed below

Sorting:

zwq2018 / embodied_reasoner
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
☆183Updated 2 months ago
Zhoues / RoboRefer
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆205Updated last month
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆296Updated 3 weeks ago
OpenHelix-Team / LLaVA-VLA
LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]
☆173Updated last month
BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆319Updated 2 months ago
IranQin / MP5
[CVPR2024] This is the official implement of MP5
☆106Updated last year
Zhangwenyao1 / DreamVLA
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆245Updated 2 months ago
OpenGVLab / VeBrain
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
☆87Updated 6 months ago
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆227Updated last month
FlagOpen / ShareRobot
☆59Updated 8 months ago
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆143Updated 11 months ago
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆119Updated 9 months ago
weijiawu / Awesome-Visual-Reinforcement-Learning
📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.
☆347Updated last week
tanhuajie / Reason-RFT
[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
☆240Updated 2 months ago
PKU-HMI-Lab / Hybrid-VLA
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆324Updated 2 months ago
embodied-generalist / embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
☆468Updated 7 months ago
baaivision / UniVLA
Unified Vision-Language-Action Model
☆245Updated last month
DelinQu / awesome-vision-language-action-model
Latest Advances on Vison-Language-Action Models.
☆119Updated 9 months ago
EmbodiedCity / Embodied-R.code
☆86Updated 6 months ago
XiaomiMiMo / MiMo-Embodied
MiMo-Embodied
☆292Updated 2 weeks ago
GigaAI-research / General-World-Models-Survey
☆467Updated last month
declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆78Updated 6 months ago
starVLA / starVLA
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆513Updated 2 weeks ago
mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM
A paper list for spatial reasoning
☆495Updated last week
alibaba-damo-academy / RynnVLA-001
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
☆268Updated this week
thuml / RLVR-World
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
☆151Updated last month
hume-vla / hume
🦾 A Dual-System VLA with System2 Thinking
☆122Updated 3 months ago
alibaba-damo-academy / RynnVLA-002
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆734Updated last week
Gary3410 / TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models
☆193Updated 2 years ago
AdaCheng / EgoThink
[CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…
☆63Updated 8 months ago