OpenHelix-Team / Unified-Diffusion-VLALinks
π₯ The first open-sourced diffusion vision-langauge-action model.
β134Updated this week
Alternatives and similar repositories for Unified-Diffusion-VLA
Users that are interested in Unified-Diffusion-VLA are comparing it to the libraries listed below
Sorting:
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human β¦β368Updated last month
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ603Updated 2 weeks ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ605Updated last week
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β259Updated last month
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.β735Updated 3 months ago
- β545Updated last month
- β248Updated 11 months ago
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Modelβ1,784Updated 3 weeks ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ191Updated last week
- RealMirror, a comprehensive, open-source embodied AI VLA platform.β104Updated this week
- β293Updated 2 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ450Updated this week
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025β53Updated 3 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ341Updated last month
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Priorβ72Updated 8 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Modelsβ252Updated last week
- π₯ [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptatβ¦β72Updated last year
- β94Updated 5 months ago
- β34Updated last year
- CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigationβ17Updated 6 months ago
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challengesβ255Updated last week
- β122Updated 3 weeks ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ126Updated 2 months ago
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequencesβ176Updated last week
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.β102Updated 8 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β200Updated 3 years ago
- Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration οΌICRA 2024οΌβ50Updated last year
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ214Updated last month
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning wβ¦β50Updated 9 months ago
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory Systemβ101Updated 4 months ago