Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.14357
☆41Jan 27, 2026Updated last month
Alternatives and similar repositories for Vid2World
Users that are interested in Vid2World are comparing it to the libraries listed below
Sorting:
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Nov 4, 2025Updated 4 months ago
- Galaxea's first diffusion policy release☆38Aug 18, 2025Updated 6 months ago
- [CoRL 2025] Robot Learning from Any Images☆34Nov 11, 2025Updated 3 months ago
- [ICCV'25] Towards Scalable Gaussian World Models for Robotic Manipulation☆91Oct 13, 2025Updated 4 months ago
- [ICRA 2026] 🌠 DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation☆29Jan 14, 2026Updated last month
- ☆13Jun 1, 2023Updated 2 years ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 5 months ago
- ☆32Feb 4, 2026Updated last month
- A data collection and processing pipeline for animal video, annotations include mask, keypoint, depth, occlusion, etc. Suitable for 3D/4D…☆46Dec 5, 2025Updated 3 months ago
- Wasserstein Gaussian Splatting☆17Dec 10, 2024Updated last year
- PISCO: Precise Video Instance Insertion with Sparse Control☆48Feb 13, 2026Updated 2 weeks ago
- Official Implementation of "Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance".☆64Oct 16, 2025Updated 4 months ago
- AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation☆35Jul 25, 2025Updated 7 months ago
- VLA model interpretability tools☆30Feb 24, 2026Updated last week
- Code for P3PO☆21Jan 31, 2025Updated last year
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆223Oct 28, 2025Updated 4 months ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆31Feb 23, 2026Updated last week
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆23Jun 27, 2025Updated 8 months ago
- ☆201Oct 22, 2025Updated 4 months ago
- [ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"☆88Jan 10, 2026Updated last month
- Code☆43Updated this week
- ☆10Jun 11, 2025Updated 8 months ago
- MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos☆45Feb 10, 2026Updated 3 weeks ago
- This is the project page of ShowRoom3D☆26Dec 22, 2023Updated 2 years ago
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆96Feb 8, 2026Updated 3 weeks ago
- Scalable Minecraft multiplayer data collection engine☆70Updated this week
- ☆53Updated this week
- Official Implementation of Paper [DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation]☆73Dec 29, 2025Updated 2 months ago
- ☆185Oct 9, 2025Updated 4 months ago
- FloVD official pytorch codes☆47May 13, 2025Updated 9 months ago
- 🦾 A Dual-System VLA with System2 Thinking☆133Aug 21, 2025Updated 6 months ago
- ☆48Jul 4, 2025Updated 8 months ago
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆115Jan 23, 2026Updated last month
- [CVPR 2026] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE☆113Updated this week
- Official Repository of “MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆56Sep 25, 2025Updated 5 months ago
- ☆28Aug 6, 2025Updated 6 months ago
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆81Feb 3, 2026Updated last month
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆53Nov 24, 2025Updated 3 months ago
- [ICCV2025] BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting☆127Sep 3, 2025Updated 6 months ago