thu-ml / MotusLinks
Official code of Motus: A Unified Latent Action World Model
β597Updated 3 weeks ago
Alternatives and similar repositories for Motus
Users that are interested in Motus are comparing it to the libraries listed below
Sorting:
- π₯ The first open-sourced diffusion vision-langauge-action model.β153Updated 2 weeks ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ1,226Updated last month
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ1,853Updated 2 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ181Updated last month
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Modelβ1,935Updated 2 months ago
- π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real Worldβ174Updated last week
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β378Updated last month
- β128Updated 2 months ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.β754Updated 4 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.β427Updated 3 weeks ago
- The accepted paper for cvpr2025.β50Updated last month
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easyβ818Updated last month
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ475Updated last week
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.β104Updated 10 months ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixelsβ174Updated last month
- β141Updated 10 months ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ200Updated last month
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Priorβ72Updated 10 months ago
- β314Updated 3 months ago
- A Unified Driving World Model for Future Generation and Perceptionβ134Updated 6 months ago
- β544Updated 2 months ago
- Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulationβ166Updated 6 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β268Updated 2 months ago
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β176Updated 5 months ago
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.β700Updated last week
- β31Updated 9 months ago
- RynnEC: Bringing MLLMs into Embodied Worldβ383Updated 2 months ago
- The code of paper "LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning" accepted by ICLR'25β147Updated 2 months ago
- This repository contains the code of the paper "IC-World: In-Context Generation for Shared World Modeling".β122Updated 2 weeks ago
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ131Updated 2 weeks ago