XiaomiMiMo / MiMo-EmbodiedLinks

MiMo-Embodied

☆95

Alternatives and similar repositories for MiMo-Embodied

Users that are interested in MiMo-Embodied are comparing it to the libraries listed below

Sorting:

thuml / RLVR-World
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
☆143Updated 3 weeks ago
FlagOpen / ShareRobot
☆59Updated 7 months ago
FlagOpen / RoboBrain-X0
☆89Updated 3 weeks ago
hume-vla / hume
🦾 A Dual-System VLA with System2 Thinking
☆116Updated 3 months ago
OpenGVLab / VeBrain
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
☆86Updated 5 months ago
zwq2018 / embodied_reasoner
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
☆179Updated 2 months ago
OpenHelix-Team / VLA-RFT
VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning
☆86Updated last month
OpenHelix-Team / LLaVA-VLA
LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]
☆172Updated 3 weeks ago
declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆76Updated 6 months ago
Little-Podi / AdaWorld
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆175Updated 5 months ago
declare-lab / nora
NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
☆190Updated this week
baaivision / UniVLA
Unified Vision-Language-Action Model
☆233Updated last month
InternRobotics / F1-VLA
F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
☆135Updated last month
ByteDance-Seed / Chain-of-Action
Official implementation of Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation. Accepted in NeurIPS 2025.
☆80Updated 2 weeks ago
ustcwhy / BitVLA
Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
☆90Updated 4 months ago
alibaba-damo-academy / RynnVLA-001
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
☆250Updated 3 weeks ago
GigaAI-research / VLA-R1
☆51Updated 2 weeks ago
InternRobotics / OST-Bench
[NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
☆67Updated last month
UMass-Embodied-AGI / Virtual-Community
Virtual Community: An Open World for Humans, Robots, and Society
☆177Updated this week
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆214Updated last month
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆117Updated 9 months ago
MARS-EAI / RoboFactory
[ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
☆94Updated 2 months ago
Zhangwenyao1 / DreamVLA
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆231Updated 2 months ago
Zhoues / RoboRefer
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆195Updated last month
OpenHelix-Team / CEED-VLA
Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
☆44Updated 2 months ago
AgibotTech / EnerVerse-AC
Official Code for EnerVerse-AC: Envisioning EmbodiedEnvironments with Action Condition
☆131Updated 4 months ago
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆54Updated 9 months ago
WM-PO / WMPO
Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
☆56Updated last week
xiaoxiao0406 / VQ-VLA
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆95Updated last week
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆139Updated 11 months ago