ByteDance-Seed/VideoWorld

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ByteDance-Seed/VideoWorld)

ByteDance-Seed / VideoWorld

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.

☆792

Alternatives and similar repositories for VideoWorld

Users that are interested in VideoWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆865Mar 19, 2026Updated 3 months ago
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,958Jan 8, 2026Updated 5 months ago
baaivision / See3D
View on GitHub
[CVPR'25 Highlight] You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
☆719Apr 16, 2025Updated last year
KlingAIResearch / GameFactory
View on GitHub
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
☆491Mar 22, 2025Updated last year
nv-tlabs / GEN3C
View on GitHub
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
☆1,371Jun 15, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
KlingAIResearch / SynCamMaster
View on GitHub
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
☆691May 23, 2025Updated last year
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆391Jul 23, 2025Updated 11 months ago
ML-GSAI / FlexWorld
View on GitHub
Official PyTorch implementation for "FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis".
☆132Sep 11, 2025Updated 9 months ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
THU-SI / VideoScene
View on GitHub
[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
☆353Jul 4, 2025Updated last year
baaivision / NOVA
View on GitHub
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆652Oct 29, 2025Updated 8 months ago
buoyancy99 / diffusion-forcing
View on GitHub
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
☆1,266Nov 9, 2025Updated 7 months ago
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆430Jun 20, 2025Updated last year
xizaoqu / WorldMem
View on GitHub
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
☆372Feb 21, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,165Mar 20, 2025Updated last year
InternRobotics / Aether
View on GitHub
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
☆598Oct 26, 2025Updated 8 months ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,661Mar 16, 2025Updated last year
SandAI-org / MAGI-1
View on GitHub
MAGI-1: Autoregressive Video Generation at Scale
☆3,726Jun 17, 2026Updated 2 weeks ago
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆194Feb 24, 2026Updated 4 months ago
tianweiy / CausVid
View on GitHub
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆1,376Aug 7, 2025Updated 10 months ago
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,768Jun 25, 2026Updated last week
baaivision / Emu3
View on GitHub
Next-Token Prediction is All You Need
☆2,423Jan 12, 2026Updated 5 months ago
THU-SI / LangScene-X
View on GitHub
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
☆303Jul 15, 2025Updated 11 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
showlab / FAR
View on GitHub
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
☆308Apr 23, 2025Updated last year
VITA-Group / Diffusion4D
View on GitHub
[NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models
☆343Jan 21, 2025Updated last year
KlingAIResearch / ReCamMaster
View on GitHub
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
☆1,827Nov 28, 2025Updated 7 months ago
Drexubery / ViewCrafter
View on GitHub
[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
☆1,567Dec 13, 2025Updated 6 months ago
FoundationVision / FlashVideo
View on GitHub
[AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
☆460Mar 5, 2025Updated last year
IGL-HKUST / DiffusionAsShader
View on GitHub
[SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
☆822Jun 9, 2025Updated last year
Wan-Video / Wan2.1
View on GitHub
Wan: Open and Advanced Large-Scale Video Generative Models
☆16,325Mar 5, 2026Updated 3 months ago
ant-research / DepthLab
View on GitHub
Official implementation of "DepthLab: From Partial to Complete"
☆551Feb 14, 2025Updated last year
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,044May 4, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,369May 7, 2026Updated last month
Saiyan-World / goku
View on GitHub
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
☆2,912Feb 19, 2025Updated last year
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆527Nov 14, 2025Updated 7 months ago
RenShuhuai-Andy / NBP
View on GitHub
Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling
☆42Feb 12, 2025Updated last year
Nut-World / NutWorld
View on GitHub
Seeing World Dynamics in a Nutshell
☆114Mar 18, 2025Updated last year
XDimLab / Prometheus
View on GitHub
[CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
☆147Jul 5, 2025Updated 11 months ago
SunYangtian / UniGeo
View on GitHub
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
☆136Jun 10, 2025Updated last year