PKU-HMI-Lab/Hybrid-VLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-HMI-Lab/Hybrid-VLA)

PKU-HMI-Lab / Hybrid-VLA

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

☆352

Alternatives and similar repositories for Hybrid-VLA

Users that are interested in Hybrid-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆1,114Nov 19, 2025Updated 8 months ago
microsoft / CogACT
View on GitHub
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆430Oct 30, 2025Updated 8 months ago
PKU-HMI-Lab / LIFT3D
View on GitHub
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
☆185Jun 20, 2025Updated last year
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,317Sep 9, 2025Updated 10 months ago
CHEN-H01 / Fast-in-Slow
View on GitHub
Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning
☆159Aug 1, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
juruobenruo / DexVLA
View on GitHub
☆63Apr 15, 2025Updated last year
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆710Jun 23, 2025Updated last year
RoboDita / Dita
View on GitHub
ICCV2025
☆171Dec 10, 2025Updated 7 months ago
hume-vla / hume
View on GitHub
🦾 A Dual-System VLA with System2 Thinking
☆148Aug 21, 2025Updated 11 months ago
Fanqi-Lin / OneTwoVLA
View on GitHub
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆236May 30, 2025Updated last year
Psi-Robot / DexGraspVLA
View on GitHub
[AAAI'26 Oral] DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping
☆557Aug 10, 2025Updated 11 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆561Jan 22, 2025Updated last year
thu-ml / RoboticsDiffusionTransformer
View on GitHub
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
☆1,764Jan 21, 2026Updated 6 months ago
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,798Jan 6, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GuanxingLu / vlarl
View on GitHub
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆447Nov 8, 2025Updated 8 months ago
Robot-VLAs / RoboVLMs
View on GitHub
☆475Apr 14, 2026Updated 3 months ago
InternRobotics / Seer
View on GitHub
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆310Jul 8, 2025Updated last year
thu-ml / RDT2
View on GitHub
Official code of RDT 2
☆795Feb 7, 2026Updated 5 months ago
PKU-EPIC / GraspVLA
View on GitHub
[CoRL25] GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data
☆390Dec 29, 2025Updated 7 months ago
OpenGalaxea / GalaxeaVLA
View on GitHub
Galaxea's open-source VLA repository
☆698Jul 11, 2026Updated 2 weeks ago
OpenHelix-Team / Awesome-VLA-RL
View on GitHub
This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.
☆426Oct 10, 2025Updated 9 months ago
starVLA / starVLA
View on GitHub
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆3,316Jul 20, 2026Updated last week
alibaba-damo-academy / RynnVLA-002
View on GitHub
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆1,103Dec 2, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
X-Square-Robot / wall-x
View on GitHub
Building General-Purpose Robots Based on Embodied Foundation Model
☆1,193Jul 21, 2026Updated last week
ZhuoyangLiu2005 / MLA
View on GitHub
MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
☆74Nov 10, 2025Updated 8 months ago
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,129Dec 20, 2025Updated 7 months ago
OpenMOSS / VLABench
View on GitHub
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆454Nov 11, 2025Updated 8 months ago
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆408May 17, 2025Updated last year
UMass-Embodied-AGI / 3D-VLA
View on GitHub
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
☆629Oct 29, 2024Updated last year
URDF-Anything-plus / Code
View on GitHub
☆44Mar 17, 2026Updated 4 months ago
InternRobotics / InternVLA-M1
View on GitHub
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆419Feb 11, 2026Updated 5 months ago
horipse01 / 3d-foundation-policy
View on GitHub
☆113Jun 2, 2026Updated last month
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
RoboTwin-Platform / RoboTwin
View on GitHub
[ICML 2026] RoboTwin 2.0 Offical Repo
☆2,647Updated this week
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
openvla / openvla
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆6,719Mar 23, 2025Updated last year
OpenDriveLab / AgiBot-World
View on GitHub
[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
☆3,107May 29, 2026Updated 2 months ago
Physical-Intelligence / real-time-chunking-kinetix
View on GitHub
Simulated experiments for "Real-Time Execution of Action Chunking Flow Policies".
☆551Dec 8, 2025Updated 7 months ago
RLinf / RLinf
View on GitHub
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
☆4,291Updated this week
PKU-HMI-Lab / AC-DiT
View on GitHub
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
☆48Feb 23, 2026Updated 5 months ago