OpenHelix-Team / Unified-Diffusion-VLALinks
π₯ The first open-sourced diffusion vision-langauge-action model.
β149Updated last week
Alternatives and similar repositories for Unified-Diffusion-VLA
Users that are interested in Unified-Diffusion-VLA are comparing it to the libraries listed below
Sorting:
- Official code of Motus: A Unified Latent Action World Modelβ423Updated last week
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β377Updated 3 weeks ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ1,243Updated last month
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ933Updated last month
- π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real Worldβ166Updated 2 weeks ago
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ120Updated this week
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ197Updated last week
- RealMirror, a comprehensive, open-source embodied AI VLA platform.β265Updated 3 weeks ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β263Updated 2 months ago
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Priorβ72Updated 9 months ago
- β127Updated last month
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.β102Updated 9 months ago
- β303Updated 2 months ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.β748Updated 4 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ466Updated 3 weeks ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ172Updated 3 weeks ago
- β94Updated 5 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ347Updated 2 weeks ago
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequencesβ179Updated 3 weeks ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixelsβ165Updated last week
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Modelβ1,875Updated last month
- β545Updated 2 months ago
- β35Updated last year
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ128Updated 2 months ago
- β247Updated 11 months ago
- π₯ [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptatβ¦β72Updated last year
- OmniNWM: Omniscient Navigation World Models for Autonomous Drivingβ265Updated 2 months ago
- A Unified Driving World Model for Future Generation and Perceptionβ132Updated 5 months ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025β54Updated 4 months ago
- Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration οΌICRA 2024οΌβ50Updated last year