LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]
☆187Mar 12, 2026Updated last month
Alternatives and similar repositories for LLaVA-VLA
Users that are interested in LLaVA-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆49Sep 15, 2025Updated 6 months ago
- ☆57Oct 3, 2024Updated last year
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.☆235Apr 1, 2026Updated last week
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆13Apr 12, 2024Updated 2 years ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆54Nov 24, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillation☆10May 9, 2025Updated 11 months ago
- GeRM: A Generalist Robotic Model with Mixture-of-Experts for Quadruped Robot https://songwxuan.github.io/GeRM/☆35Apr 29, 2025Updated 11 months ago
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions☆1,049Nov 19, 2025Updated 4 months ago
- ☆15Oct 10, 2024Updated last year
- 🔥 The first open-sourced diffusion vision-langauge-action model. [ICLR 2026]☆174Mar 12, 2026Updated last month
- [NeurIPS 2025] VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching☆76Feb 27, 2026Updated last month
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]☆205Mar 12, 2026Updated last month
- Reinforcing Action Policies by Prophesying☆40Nov 26, 2025Updated 4 months ago
- Code for OctoNav-Bench and OctoNav-R1☆66Mar 19, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Jun 5, 2025Updated 10 months ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆346Oct 3, 2025Updated 6 months ago
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions DSP☆78Jan 14, 2026Updated 3 months ago
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆18Jan 5, 2026Updated 3 months ago
- [Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV,…☆521Mar 16, 2026Updated 3 weeks ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆143Nov 4, 2025Updated 5 months ago
- [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning☆1,562Jan 6, 2026Updated 3 months ago
- ☆42Mar 26, 2025Updated last year
- ☆13Sep 14, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆242Jul 14, 2025Updated 8 months ago
- Towards Generalizable Robotic Manipulation in Dynamic Environments☆136Apr 1, 2026Updated last week
- ☆21May 28, 2025Updated 10 months ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆113Jan 14, 2026Updated 3 months ago
- The public reproducible analysis code used for the gaze project☆11Feb 21, 2026Updated last month
- Official Code for SGRv2 and SGR.☆33May 20, 2025Updated 10 months ago
- This is the official repository for VLN-CLASH.☆24Aug 5, 2025Updated 8 months ago
- [NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆457Feb 5, 2026Updated 2 months ago
- [AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving☆35Dec 23, 2025Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆27Dec 19, 2025Updated 3 months ago
- [NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆75Sep 29, 2025Updated 6 months ago
- [ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"☆74Oct 25, 2025Updated 5 months ago
- Implementation of PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching☆18Sep 18, 2025Updated 6 months ago
- ☆74Jan 20, 2026Updated 2 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated last month
- OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation☆358Aug 27, 2025Updated 7 months ago