🔥 The first open-sourced diffusion vision-langauge-action model.
☆163Jan 8, 2026Updated last month
Alternatives and similar repositories for Unified-Diffusion-VLA
Users that are interested in Unified-Diffusion-VLA are comparing it to the libraries listed below
Sorting:
- LITEN: Learning from Inference Time Execution for VLAs☆26Oct 23, 2025Updated 4 months ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆48Sep 15, 2025Updated 5 months ago
- [NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning☆41Oct 14, 2025Updated 4 months ago
- Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression☆12Mar 17, 2025Updated 11 months ago
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillation☆10May 9, 2025Updated 9 months ago
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.☆216Updated this week
- ☆13Nov 14, 2023Updated 2 years ago
- The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)☆19Jun 25, 2025Updated 8 months ago
- Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation☆14Jan 31, 2026Updated last month
- This is the code related to "Zero-Shot Point Cloud Segmentation by Semantic-Visual Aware Synthesis" (ICCV 2023)☆17Dec 15, 2023Updated 2 years ago
- Official implementation of LLM+MAP: Bimanual Robot Task Planning using Large Language Models (LLMs) and Planning Domain Definition Langua…☆20Mar 24, 2025Updated 11 months ago
- LUMOS: Language-Conditioned Imitation Learning with World Models☆16Updated this week
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆178Oct 29, 2025Updated 4 months ago
- HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆47Dec 11, 2025Updated 2 months ago
- This repository contains benchmarking code for the ICRA 2023 submission titled Multi-Contact Task and Motion Planning Guided by Video Dem…☆14Apr 20, 2025Updated 10 months ago
- [ICRA 25] FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning☆44Jan 5, 2025Updated last year
- AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation☆35Jul 25, 2025Updated 7 months ago
- ☆23Dec 31, 2024Updated last year
- Sim-Suction-API offers a simulation framework to generate synthetic data and train models for robotic suction grasping in cluttered envir…☆46Nov 9, 2023Updated 2 years ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆28Nov 21, 2025Updated 3 months ago
- DiffuBox: Refining 3D Object Detection with Point Diffusion☆20Mar 9, 2025Updated 11 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆185Jan 8, 2026Updated last month
- [AAAI 2026] Official code for MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Man…☆65Jul 31, 2025Updated 7 months ago
- Official codebase for PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors☆45Oct 1, 2024Updated last year
- SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation☆25Jan 18, 2026Updated last month
- Official Release of "Mixture of Horizons in Action Chunking"☆41Dec 3, 2025Updated 3 months ago
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆25Sep 26, 2025Updated 5 months ago
- OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation☆346Aug 27, 2025Updated 6 months ago
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆62Jan 13, 2026Updated last month
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 9 months ago
- Simplifying diffusion/flow policies by treating action trajectories as flow trajectories☆94Oct 15, 2025Updated 4 months ago
- Code for our ICRA 2024 paper on learning diverse skills☆26Apr 6, 2024Updated last year
- [ICRA 2023] Sim2Real^2: Actively Building Explicit Physics Model for Precise Articulated Object Manipulation☆23Aug 21, 2023Updated 2 years ago
- Training recipe for SpatialReasoner☆38Sep 21, 2025Updated 5 months ago
- [IROS 2024] Lightweight Language-driven Grasp Detection using Conditional Consisitency Model☆28Aug 14, 2024Updated last year
- [CoRL 2025] Robot Learning from Any Images☆34Nov 11, 2025Updated 3 months ago
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆90Oct 8, 2025Updated 4 months ago
- [ICRA 2024] Official Implementation of the paper "Parameter-efficient Prompt Learning for 3D Point Cloud Understanding"☆28Feb 24, 2025Updated last year