MiMo-Embodied
☆358Nov 21, 2025Updated 3 months ago
Alternatives and similar repositories for MiMo-Embodied
Users that are interested in MiMo-Embodied are comparing it to the libraries listed below
Sorting:
- [ICML 2025] The Official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"☆30Dec 15, 2025Updated 2 months ago
- ☆104Dec 4, 2025Updated 3 months ago
- StreamDiffusion, Live Stream APP☆364Updated this week
- ☆26Mar 11, 2025Updated 11 months ago
- [AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving☆30Dec 23, 2025Updated 2 months ago
- (ICLR2025) Enhancing End-to-End Autonomous Driving with Latent World Model☆318Jun 29, 2025Updated 8 months ago
- Cosmos Policy☆555Jan 23, 2026Updated last month
- [CoRL '25] Pseudo-Simulation for Autonomous Driving; [NeurIPS '24] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Ben…☆890Oct 27, 2025Updated 4 months ago
- [NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning☆41Oct 14, 2025Updated 4 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆83Jan 16, 2026Updated last month
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆15Jul 15, 2025Updated 7 months ago
- ☆26Jul 29, 2025Updated 7 months ago
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆15Feb 2, 2026Updated last month
- ☆11Mar 13, 2023Updated 2 years ago
- ☆96Jan 3, 2026Updated 2 months ago
- A Large-scale Video Action Dataset☆417Jan 16, 2026Updated last month
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆214Oct 12, 2025Updated 4 months ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆108Jan 14, 2026Updated last month
- Generate slides using GPT models☆10Feb 24, 2023Updated 3 years ago
- ☆38Oct 16, 2025Updated 4 months ago
- [CVPR 2026] A research framework for autonomous driving in CARLA, features TransFuser v6. Accompanies the paper "LEAD: Minimizing Learner…☆122Feb 25, 2026Updated last week
- Advancing the frontier of efficient AI☆54Updated this week
- ☆19Jun 4, 2025Updated 9 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆51Jan 23, 2026Updated last month
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 6 months ago
- Submission template for Tiny Tapeout SKY130 (ChipFoundry) shuttles - Verilog HDL Projects☆31Feb 27, 2026Updated last week
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆42Nov 15, 2024Updated last year
- [ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps☆12Apr 10, 2025Updated 10 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- The official repository for the paper "Real-world Reinforcement Learning from Suboptimal Interventions”.☆38Updated this week
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆414Jan 6, 2026Updated 2 months ago
- Cambrian-S: Towards Spatial Supersensing in Video☆500Dec 27, 2025Updated 2 months ago
- ☆37Jan 18, 2023Updated 3 years ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆44Sep 26, 2025Updated 5 months ago
- ☆18Mar 19, 2025Updated 11 months ago
- [CVPR 2026] DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning☆83Feb 21, 2026Updated last week
- ☆26Sep 26, 2025Updated 5 months ago
- Official implementation of Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts(IJCAI 2024)☆15Oct 16, 2024Updated last year