OpenHelix-Team / Unified-Diffusion-VLALinks
π₯ The first open-sourced diffusion vision-langauge-action model.
β160Updated last month
Alternatives and similar repositories for Unified-Diffusion-VLA
Users that are interested in Unified-Diffusion-VLA are comparing it to the libraries listed below
Sorting:
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β380Updated last month
- Official code of Motus: A Unified Latent Action World Modelβ740Updated last month
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ2,236Updated last week
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ1,471Updated 2 months ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ203Updated last month
- π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real Worldβ178Updated 3 weeks ago
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ135Updated last week
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β270Updated 3 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ482Updated 3 weeks ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ185Updated 2 months ago
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Priorβ73Updated 10 months ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.β759Updated 5 months ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Drivingβ166Updated last week
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.β104Updated 10 months ago
- β129Updated 2 months ago
- β545Updated 3 months ago
- Official implementation of [AstraNav-World: World Model for Foresight Control and Consistency]β66Updated 3 weeks ago
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequencesβ184Updated 2 months ago
- β93Updated 7 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ130Updated 4 months ago
- β37Updated last year
- [Official] AstraNav-Memory: Contexts Compression for Long Memory. An image-centric memory framework for lifelong embodied navigation via β¦β29Updated 3 weeks ago
- β324Updated 3 months ago
- The accepted paper for cvpr2025.β55Updated 2 months ago
- [ICRA 2026] A Unified Driving World Model for Future Generation and Perceptionβ136Updated last week
- Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration οΌICRA 2024οΌβ50Updated last year
- [ICRA2024] The official implementation of Robot Trajectronβ111Updated last month
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challengesβ296Updated last week
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Modelβ1,971Updated 2 months ago
- β246Updated last year