ShuangLI59/unified_video_action

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShuangLI59/unified_video_action)

ShuangLI59 / unified_video_action

Official PyTorch Implementation of Unified Video Action Model (RSS 2025)

☆400

Alternatives and similar repositories for unified_video_action

Users that are interested in unified_video_action are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WEIRDLabUW / unified-world-model
View on GitHub
Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
☆246Oct 8, 2025Updated 9 months ago
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆840Jan 23, 2026Updated 6 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆562Jan 22, 2025Updated last year
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆407May 17, 2025Updated last year
buoyancy99 / large-video-planner
View on GitHub
☆256Jan 31, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lzylucy / 4dgen
View on GitHub
[ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆123Jan 10, 2026Updated 6 months ago
TEA-Lab / DemoGen
View on GitHub
[RSS25] Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning
☆252Jul 18, 2025Updated last year
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,695Jul 9, 2026Updated 2 weeks ago
Robert-gyj / Ctrl-World
View on GitHub
ICLR 2026 Paper: Ctrl-World
☆538Apr 8, 2026Updated 3 months ago
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,130Dec 20, 2025Updated 7 months ago
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,504Apr 19, 2026Updated 3 months ago
thu-ml / RDT2
View on GitHub
Official code of RDT 2
☆795Feb 7, 2026Updated 5 months ago
YanjieZe / 3D-Diffusion-Policy
View on GitHub
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
☆1,415Oct 17, 2025Updated 9 months ago
alibaba-damo-academy / RynnVLA-002
View on GitHub
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆1,103Dec 2, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,221Apr 3, 2026Updated 3 months ago
HeegerGao / FLIP
View on GitHub
Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
☆85Dec 12, 2024Updated last year
OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆1,114Nov 19, 2025Updated 8 months ago
robocasa / robocasa
View on GitHub
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
☆1,592Jul 8, 2026Updated 3 weeks ago
RoboVerseOrg / RoboVerse
View on GitHub
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
☆1,791Updated this week
flow-diffusion / AVDC
View on GitHub
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆262Apr 25, 2024Updated 2 years ago
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆180Oct 1, 2025Updated 9 months ago
RogerQi / human-policy
View on GitHub
☆257May 12, 2025Updated last year
NVIDIA / GR00T-Dreams
View on GitHub
DreamGen: Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
☆592Oct 24, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AgibotTech / Genie-Envisioner-V1
View on GitHub
☆565Jun 24, 2026Updated last month
Little-Podi / AdaWorld
View on GitHub
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆254Jun 17, 2025Updated last year
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,798Jan 6, 2026Updated 6 months ago
thu-ml / RoboticsDiffusionTransformer
View on GitHub
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
☆1,766Jan 21, 2026Updated 6 months ago
UMass-Embodied-AGI / TesserAct
View on GitHub
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆403Aug 4, 2025Updated 11 months ago
Max-Fu / icrt
View on GitHub
[ICRA 2025] In-Context Imitation Learning via Next-Token Prediction
☆120Mar 17, 2025Updated last year
real-stanford / diffusion_policy
View on GitHub
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
☆4,419Dec 24, 2024Updated last year
huangwl18 / ReKep
View on GitHub
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
☆976Feb 20, 2025Updated last year
YanjieZe / Improved-3D-Diffusion-Policy
View on GitHub
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
☆552Jun 16, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OpenDriveLab / AgiBot-World
View on GitHub
[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
☆3,111May 29, 2026Updated 2 months ago
OpenMOSS / VLABench
View on GitHub
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆455Nov 11, 2025Updated 8 months ago
lyttttt3333 / CodeDiffuser
View on GitHub
☆38Jun 19, 2025Updated last year
NVIDIA / DreamDojo
View on GitHub
Official Codebase for "DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos" (ICML 2026)
☆1,023Mar 21, 2026Updated 4 months ago
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆2,110Mar 15, 2025Updated last year
RoboDita / Dita
View on GitHub
ICCV2025
☆171Dec 10, 2025Updated 7 months ago
real-stanford / im2Flow2Act
View on GitHub
[CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface
☆161Oct 17, 2024Updated last year