π₯ The first open-sourced diffusion vision-langauge-action model. [ICLR 2026]
β165Mar 12, 2026Updated last week
Alternatives and similar repositories for Unified-Diffusion-VLA
Users that are interested in Unified-Diffusion-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillationβ10May 9, 2025Updated 10 months ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.β49Sep 15, 2025Updated 6 months ago
- [CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Modelβ50Mar 11, 2026Updated last week
- Gaussian Splatting for Robotic Simulationβ22Nov 7, 2025Updated 4 months ago
- MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systemsβ43Oct 17, 2025Updated 5 months ago
- [NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoningβ40Oct 14, 2025Updated 5 months ago
- [AAAI 2026] Official code for MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manβ¦β64Jul 31, 2025Updated 7 months ago
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.β228Updated this week
- LITEN: Learning from Inference Time Execution for VLAsβ27Oct 23, 2025Updated 5 months ago
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]β185Mar 12, 2026Updated last week
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]β193Mar 12, 2026Updated last week
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignmentβ35Feb 24, 2026Updated last month
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Modelβ2,052Updated this week
- The official repo for the DanQing dataset.β32Jan 16, 2026Updated 2 months ago
- [AAAI 2026 Oral] Harnessing Vision-Language Models for Time Series Anomaly Detectionβ71Feb 13, 2026Updated last month
- The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)β20Jun 25, 2025Updated 8 months ago
- [NeurIPS 2025] Official code repository for "Failure Prediction at Runtime for Generative Robot Policies".β33Nov 3, 2025Updated 4 months ago
- Minimalist ML framework for Go.β456Dec 29, 2025Updated 2 months ago
- unifai-sdk-py is the Python SDK for Unifai, an AI native platform for dynamic tools and agent to agent communication.β139Jan 26, 2026Updated last month
- GeRM: A Generalist Robotic Model with Mixture-of-Experts for Quadruped Robot https://songwxuan.github.io/GeRM/β35Apr 29, 2025Updated 10 months ago
- This is a Spring Cloud project that integrates with AI Front-end vue3 Backend Spring Cloud Main function: It primarily achieves emotionalβ¦β40Jan 16, 2026Updated 2 months ago
- β121Dec 17, 2025Updated 3 months ago
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memoryβ61Jan 13, 2026Updated 2 months ago
- Official code for paper Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Modelsβ59Jan 16, 2026Updated 2 months ago
- MSWALβ14Nov 7, 2025Updated 4 months ago
- LUMOS: Language-Conditioned Imitation Learning with World Modelsβ16Mar 4, 2026Updated 2 weeks ago
- Open-Tax is an AI-powered cloud platform transforming tax compliance through automated data integration, real-time anomaly detection, andβ¦β410Feb 3, 2025Updated last year
- This is the code for Hyperspectral Anomaly Detection With Guided Autoencoder.β39Nov 19, 2022Updated 3 years ago
- Code Release for NeurIPS 2025, "COS3D: Collaborative Open-Vocabulary 3D Segmentation"β16Dec 21, 2025Updated 3 months ago
- Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding (CVPR 2025 Oral)β38Nov 28, 2025Updated 3 months ago
- [ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamicsβ124Mar 17, 2026Updated last week
- A comprehensive toolkit for time series analysis, including scripts for visualizing results, detecting stationarity, trends, seasonality,β¦β87Mar 4, 2026Updated 2 weeks ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detectionβ12Feb 6, 2024Updated 2 years ago
- β49Feb 22, 2025Updated last year
- Official codebase for PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priorsβ45Oct 1, 2024Updated last year
- π Production-ready C++ framework: Build games, trading systems & network apps in minutes. Qt6 | libuv | C++23 | Docker Readyβ23Feb 12, 2026Updated last month
- Sim-Suction-API offers a simulation framework to generate synthetic data and train models for robotic suction grasping in cluttered envirβ¦β46Nov 9, 2023Updated 2 years ago
- β170Mar 12, 2026Updated last week
- [IROS 2025] Adaptive Visuo-Tactile Fusion with Predictive Force Attention for Dexterous Manipulationβ21Mar 1, 2026Updated 3 weeks ago