OpenHelix-Team / Unified-Diffusion-VLALinks
🔥 The first open-sourced diffusion vision-langauge-action model.
☆57Updated this week
Alternatives and similar repositories for Unified-Diffusion-VLA
Users that are interested in Unified-Diffusion-VLA are comparing it to the libraries listed below
Sorting:
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆96Updated 5 months ago
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆37Updated this week
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆66Updated 8 months ago
- ☆90Updated 10 months ago
- This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.☆24Updated 3 months ago
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.☆67Updated last week
- ☆45Updated 3 months ago
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆43Updated 4 months ago
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆63Updated 2 weeks ago
- Official Code for "From Cognition to Precognition: A Future-Aware Framework for Social Navigation" (ICRA 2025)☆90Updated last month
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆73Updated 8 months ago
- ☆49Updated 2 weeks ago
- Nav-R1: Reasoning and Navigation in Embodied Scenes☆71Updated 3 weeks ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆29Updated 4 months ago
- ☆20Updated 3 weeks ago
- ✨✨【NeurIPS 2025】Official implementation of BridgeVLA☆155Updated 2 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆24Updated this week
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆45Updated this week
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆51Updated 7 months ago
- [ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-S…☆86Updated 5 months ago
- ☆50Updated last month
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆120Updated 7 months ago
- [AAAI 25] The official implementation of Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation☆41Updated 8 months ago
- Code for OctoNav-R1☆60Updated 5 months ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆47Updated 11 months ago
- PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators☆102Updated last year
- Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"☆306Updated 3 weeks ago
- Code for "ACG: Action Coherence Guidance for Flow-based VLA Models"☆37Updated 3 weeks ago
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆50Updated 5 months ago
- Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models☆53Updated last year