EO-Robotics / EO1Links
EO: Open-source Unified Embodied Foundation Model Series
☆280Updated last month
Alternatives and similar repositories for EO1
Users that are interested in EO1 are comparing it to the libraries listed below
Sorting:
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆472Updated 3 months ago
- [NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving…☆551Updated 3 months ago
- This is a project about visual spatial reasoning.☆82Updated last month
- Official implementation for "JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation"☆334Updated 2 months ago
- (Preprint) ORV: 4D Occupancy-centric Robot Video Generation.☆72Updated last month
- Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-e…☆88Updated last month
- [2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation☆45Updated last month
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 5 months ago
- [NeurIPS`25] TC-Light: Temporally Coherent Generative Rendering for Realistic World Transfer☆99Updated last month
- 🦾 A Dual-System VLA with System2 Thinking☆128Updated 4 months ago
- ☆22Updated 8 months ago
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.☆617Updated 6 months ago
- A curated list of papers on reinforcement learning for video generation☆281Updated last month
- [ICCV 2025] Official implementation of the paper “MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adapti…☆677Updated 6 months ago
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆210Updated 5 months ago
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆323Updated 2 weeks ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆333Updated 3 months ago
- [AAAI2026 Oral] Official implementation of "StyleDrive: Towards Driving-Style Aware Benchmarking of End-To-End Autonomous Driving"☆109Updated last month
- [CVPR 2025] Neural Motion Simulator Pushing the Limit of World Models in Reinforcement Learning☆45Updated 6 months ago
- RynnVLA-002: A Unified Vision-Language-Action and World Model☆818Updated last month
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆265Updated 3 months ago
- RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation☆275Updated last month
- Official implementation of our NeurIPS 2025 paper: "FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mix…☆171Updated last month
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆431Updated 11 months ago
- EO: Open-source Unified Embodied Foundation Model Series☆33Updated 2 weeks ago
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆173Updated 2 months ago
- Official implementation of "Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models".☆60Updated 3 weeks ago
- ☆31Updated 4 months ago
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆343Updated 8 months ago
- The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"☆404Updated last week