roboterax / video-prediction-policyLinks

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io

☆314

Alternatives and similar repositories for video-prediction-policy

Users that are interested in video-prediction-policy are comparing it to the libraries listed below

Sorting:

2toinf / UniAct
[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"
☆220Updated last month
OpenGalaxea / G0
Galaxea's first VLA release
☆330Updated 2 months ago
alibaba-damo-academy / RynnVLA-001
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
☆271Updated 3 weeks ago
GuanxingLu / vlarl
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆370Updated last month
Robot-VLAs / RoboVLMs
☆417Updated 3 weeks ago
AgibotTech / Genie-Envisioner
☆342Updated this week
2toinf / X-VLA
The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
☆378Updated 3 weeks ago
OpenHelix-Team / OpenHelix
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆331Updated 4 months ago
PKU-HMI-Lab / Hybrid-VLA
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆330Updated 2 months ago
bytedance / GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆296Updated last year
microsoft / CogACT
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆393Updated last month
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆205Updated 6 months ago
Dexmal / dexbotic
Dexbotic: Open-Source Vision-Language-Action Toolbox
☆615Updated this week
SpatialVLA / SpatialVLA
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆600Updated 6 months ago
starVLA / starVLA
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆629Updated this week
InternRobotics / InternManip
An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.
☆166Updated 2 months ago
HeegerGao / VLA-OS
Official Code For VLA-OS.
☆132Updated 6 months ago
PKU-EPIC / GraspVLA
GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data
☆300Updated 5 months ago
ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆309Updated 5 months ago
InternRobotics / Seer
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆270Updated 5 months ago
Psi-Robot / DexGraspVLA
[AAAI'26 Oral] DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping
☆451Updated 4 months ago
thu-ml / RDT2
Official code of RDT 2
☆605Updated 3 weeks ago
InternRobotics / VLAC
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
☆249Updated 3 months ago
cccedric / conrft
This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".
☆304Updated last month
Fanqi-Lin / Data-Scaling-Laws
Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"
☆197Updated last year
X-Square-Robot / wall-x
Building General-Purpose Robots Based on Embodied Foundation Model
☆638Updated 2 weeks ago
OpenMOSS / VLABench
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆350Updated last month
ZibinDong / openpi_pytorch
Pytorch PI-zero and PI-zero-fast. Adapted from LeRobot
☆163Updated 3 months ago
unitreerobotics / unifolm-world-model-action
☆749Updated 2 months ago
RoboDita / Dita
ICCV2025
☆145Updated 2 weeks ago