buoyancy99/large-video-planner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/buoyancy99/large-video-planner)

buoyancy99 / large-video-planner

☆256

Alternatives and similar repositories for large-video-planner

Users that are interested in large-video-planner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆842Jan 23, 2026Updated 6 months ago
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,509Apr 19, 2026Updated 3 months ago
lzylucy / 4dgen
View on GitHub
[ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆123Jan 10, 2026Updated 6 months ago
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,700Jul 9, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
huangwl18 / PointWorld
View on GitHub
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
☆418Mar 11, 2026Updated 4 months ago
GaTech-RL2 / EgoVerse
View on GitHub
EgoVerse: Egocentric Data for Robot Learning from Around the World
☆494Updated this week
UMass-Embodied-AGI / TesserAct
View on GitHub
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆404Aug 4, 2025Updated 11 months ago
thu-ml / Motus
View on GitHub
Official code of Motus: A Unified Latent Action World Model
☆1,216Jan 5, 2026Updated 6 months ago
nvidia-cosmos / cosmos-predict2.5
View on GitHub
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …
☆1,338Jun 8, 2026Updated last month
Robert-gyj / Ctrl-World
View on GitHub
ICLR 2026 Paper: Ctrl-World
☆538Apr 8, 2026Updated 3 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆562Jan 22, 2025Updated last year
UT-Austin-RPL / mimicdroid-robocasa
View on GitHub
MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos
☆51Feb 10, 2026Updated 5 months ago
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,225Apr 3, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NVIDIA / DreamDojo
View on GitHub
Official Codebase for "DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos" (ICML 2026)
☆1,023Mar 21, 2026Updated 4 months ago
facebookresearch / AINA
View on GitHub
Official implementation of Dexterity from Smart Lenses Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations. Project w…
☆58Dec 26, 2025Updated 7 months ago
facebookresearch / Action100M
View on GitHub
A Large-scale Video Action Dataset
☆483Jan 16, 2026Updated 6 months ago
RogerQi / human-policy
View on GitHub
☆257May 12, 2025Updated last year
NVIDIA / GR00T-Dreams
View on GitHub
DreamGen: Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
☆592Oct 24, 2025Updated 9 months ago
Spirit-AI-Team / spirit-v1.5
View on GitHub
Spirit-v1.5: A Robotic Foundation Model by Spirit AI
☆631May 29, 2026Updated 2 months ago
xizaoqu / WorldMem
View on GitHub
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
☆381Feb 21, 2026Updated 5 months ago
world-model-eval / world-model-eval
View on GitHub
Code for "Evaluating Robot Policies in a World Model".
☆101Nov 6, 2025Updated 8 months ago
apple / ml-egodex
View on GitHub
EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video
☆350Aug 20, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Little-Podi / AdaWorld
View on GitHub
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆254Jun 17, 2025Updated last year
WangYixuan12 / interactive_world_sim
View on GitHub
[RSS 2026] Interactive World Simulator for Robot Policy Training and Evaluation
☆281Jun 4, 2026Updated last month
lucidrains / mimic-video
View on GitHub
Implementation of Mimic-Video, Video-Action Models for SOTA Generalizable Robot Control Beyond VLAs
☆119Updated this week
WEIRDLabUW / unified-world-model
View on GitHub
Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
☆247Oct 8, 2025Updated 9 months ago
malik-group / do-as-i-do
View on GitHub
Official Codebase for "Do as I Do: Dexterous Manipulation Data from Everyday Human Videos"
☆336Jul 22, 2026Updated last week
facebookresearch / spider
View on GitHub
A general physic-based retargeting framework.
☆510Jun 30, 2026Updated last month
kwsong0113 / diffusion-forcing-transformer
View on GitHub
[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"
☆705Jul 1, 2025Updated last year
World-In-World / world-in-world
View on GitHub
"World Models in a Closed-Loop World" (ICLR'26 Oral)
☆181Apr 3, 2026Updated 3 months ago
cvlab-columbia / videopolicy
View on GitHub
☆64Mar 3, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
eldar / vdpm
View on GitHub
Official implementation of Video-DPM
☆242Jan 19, 2026Updated 6 months ago
tsinghua-fib-lab / WorldArena
View on GitHub
WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models
☆250Jul 22, 2026Updated last week
thuml / Vid2World
View on GitHub
Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…
☆70Jan 27, 2026Updated 6 months ago
sihengz02 / RoLA
View on GitHub
[CoRL 2025] Robot Learning from Any Images
☆34Nov 11, 2025Updated 8 months ago
video-to-action / video-to-action-release
View on GitHub
[ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration
☆62May 4, 2025Updated last year
ThunderVVV / HaWoR
View on GitHub
HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos
☆309Apr 16, 2026Updated 3 months ago
shivanshpatel35 / rigvid
View on GitHub
☆62Jul 4, 2025Updated last year