Zhangwenyao1/DreamVLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zhangwenyao1/DreamVLA)

Zhangwenyao1 / DreamVLA

[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge

☆364

Alternatives and similar repositories for DreamVLA

Users that are interested in DreamVLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alibaba-damo-academy / RynnVLA-002
View on GitHub
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆1,104Dec 2, 2025Updated 7 months ago
OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆1,115Nov 19, 2025Updated 8 months ago
InternRobotics / Seer
View on GitHub
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆310Jul 8, 2025Updated last year
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,320Sep 9, 2025Updated 10 months ago
thu-ml / Motus
View on GitHub
Official code of Motus: A Unified Latent Action World Model
☆1,216Jan 5, 2026Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆711Jun 23, 2025Updated last year
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,799Jan 6, 2026Updated 6 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆562Jan 22, 2025Updated last year
ginwind / VLA-JEPA
View on GitHub
[ECCV 2026] VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model
☆511May 2, 2026Updated 2 months ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆2,115Mar 15, 2025Updated last year
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,700Jul 9, 2026Updated 3 weeks ago
starVLA / starVLA
View on GitHub
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆3,340Updated this week
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆842Jan 23, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
baaivision / UniVLA
View on GitHub
[ICLR 2026] Unified Vision-Language-Action Model
☆316Oct 15, 2025Updated 9 months ago
hume-vla / hume
View on GitHub
🦾 A Dual-System VLA with System2 Thinking
☆148Aug 21, 2025Updated 11 months ago
OpenHelix-Team / Spatial-Forcing
View on GitHub
Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]
☆271Jul 7, 2026Updated 3 weeks ago
SJTU-DENG-Lab / Mantis
View on GitHub
[CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
☆92Jun 5, 2026Updated last month
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,509Apr 19, 2026Updated 3 months ago
HeegerGao / VLA-OS
View on GitHub
Official Code For VLA-OS.
☆145Jun 25, 2025Updated last year
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆963Sep 8, 2025Updated 10 months ago
InternRobotics / InternVLA-M1
View on GitHub
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆418Feb 11, 2026Updated 5 months ago
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,132Dec 20, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆407May 17, 2025Updated last year
LogosRoboticsGroup / 4D-VLA
View on GitHub
4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.
☆58Jan 10, 2026Updated 6 months ago
gen-robot / RL4VLA
View on GitHub
☆279Aug 25, 2025Updated 11 months ago
thu-ml / RDT2
View on GitHub
Official code of RDT 2
☆797Feb 7, 2026Updated 5 months ago
OpenHelix-Team / ReconVLA
View on GitHub
Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
☆269Apr 1, 2026Updated 4 months ago
BridgeVLA / BridgeVLA
View on GitHub
✨✨【NeurIPS 2025】Official implementation of BridgeVLA
☆194Apr 5, 2026Updated 3 months ago
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,225Apr 3, 2026Updated 3 months ago
2toinf / X-VLA
View on GitHub
[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
☆697Jun 10, 2026Updated last month
microsoft / CogACT
View on GitHub
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆429Oct 30, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PKU-HMI-Lab / Hybrid-VLA
View on GitHub
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆352Oct 3, 2025Updated 9 months ago
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆180Oct 1, 2025Updated 10 months ago
qizekun / SoFar
View on GitHub
[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
☆246Jun 30, 2025Updated last year
RoboTwin-Platform / RoboTwin
View on GitHub
[ICML 2026] RoboTwin 2.0 Offical Repo
☆2,659Updated this week
PKU-EPIC / GraspVLA
View on GitHub
[CoRL25] GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data
☆390Dec 29, 2025Updated 7 months ago
InternRobotics / VLAC
View on GitHub
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
☆321Jul 13, 2026Updated 2 weeks ago
GigaAI-research / VLA-R1
View on GitHub
☆74Jun 18, 2026Updated last month