Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
☆201Oct 8, 2025Updated 5 months ago
Alternatives and similar repositories for unified-world-model
Users that are interested in unified-world-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆350Jul 23, 2025Updated 8 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆79Dec 12, 2024Updated last year
- ☆40Mar 26, 2025Updated last year
- Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation☆89Jul 21, 2025Updated 8 months ago
- ICCV2025☆163Dec 10, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆90Sep 23, 2025Updated 6 months ago
- ☆14Feb 13, 2025Updated last year
- ☆64Sep 18, 2025Updated 6 months ago
- [ICRA 2025] In-Context Imitation Learning via Next-Token Prediction☆110Mar 17, 2025Updated last year
- ☆24Jun 11, 2025Updated 9 months ago
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆155Oct 17, 2024Updated last year
- ☆100Sep 5, 2024Updated last year
- Hand-object interaction Pretraining From Videos☆116Aug 26, 2025Updated 7 months ago
- Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io☆369May 17, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of Crossing the Human-Robot Embodiment Gap with Sim-to-Real RL using One Human Demonstration☆144Feb 8, 2026Updated last month
- RynnVLA-002: A Unified Vision-Language-Action and World Model☆955Dec 2, 2025Updated 3 months ago
- Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"☆109Oct 24, 2025Updated 5 months ago
- ☆23Mar 4, 2026Updated 3 weeks ago
- Official repository for LeLaN training and inference code☆132Sep 27, 2024Updated last year
- Code for "Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups", IROS 2025☆119Mar 14, 2026Updated last week
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success☆1,094Sep 9, 2025Updated 6 months ago
- ☆19Jun 26, 2024Updated last year
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆30Dec 9, 2025Updated 3 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆108Nov 21, 2024Updated last year
- ☆35Mar 11, 2025Updated last year
- [ICRA 2026] Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation☆135Mar 16, 2026Updated last week
- ☆92Feb 13, 2025Updated last year
- RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning☆1,688Mar 16, 2026Updated last week
- [CVPR 2025] 🎉 Official repository of "ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning"☆296Oct 10, 2025Updated 5 months ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆167Oct 1, 2025Updated 5 months ago
- ☆96Sep 4, 2024Updated last year
- Official PyTorch implementation for NeurIPS 2024 paper: Prediction with Action.☆49Jan 4, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆492Jan 22, 2025Updated last year
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆37Feb 23, 2026Updated last month
- Official implementation of Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics☆41Mar 11, 2025Updated last year
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆251Apr 25, 2024Updated last year
- ☆46Apr 2, 2025Updated 11 months ago
- Official Repo for the paper "Learning Visual Parkour from Generated Images" (CoRL 2024).☆154Nov 15, 2024Updated last year
- ☆80Oct 21, 2024Updated last year