PKU-YuanGroup / Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆11,950Updated last month
Alternatives and similar repositories for Open-Sora-Plan:
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
- Open-Sora: Democratizing Efficient Video Production for All☆26,355Updated this week
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,077Updated 3 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,281Updated 7 months ago
- [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,197Updated 9 months ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,329Updated last month
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,820Updated 3 weeks ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,270Updated last year
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,266Updated 2 months ago
- [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling:…☆7,731Updated last month
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,224Updated 11 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,936Updated 10 months ago
- Kolors Team☆4,377Updated 5 months ago
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,683Updated last year
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,346Updated 2 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,410Updated 7 months ago
- Official inference repo for FLUX.1 models☆21,541Updated 3 months ago
- VideoSys: An easy and efficient system for video generation☆1,959Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,354Updated 8 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆9,881Updated last week
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,690Updated 10 months ago
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆7,334Updated 6 months ago
- The best OSS video generation models☆3,135Updated 3 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,272Updated 6 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,105Updated 3 months ago
- Mora: More like Sora for Generalist Video Generation☆1,554Updated 6 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,182Updated 2 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,832Updated 5 months ago
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,312Updated 3 weeks ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,070Updated 6 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,845Updated 7 months ago