PKU-YuanGroup / Open-Sora-PlanLinks
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆11,998Updated last week
Alternatives and similar repositories for Open-Sora-Plan
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
Sorting:
- Open-Sora: Democratizing Efficient Video Production for All☆26,885Updated 2 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,306Updated 9 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,308Updated 9 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,296Updated last year
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,264Updated 5 months ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,313Updated 2 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,559Updated last year
- Mora: More like Sora for Generalist Video Generation☆1,565Updated 9 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,201Updated 6 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,850Updated 3 months ago
- VideoSys: An easy and efficient system for video generation☆1,986Updated 4 months ago
- Official Code for Stable Cascade☆6,594Updated 11 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆4,666Updated last week
- TripoSR: Fast 3D Object Reconstruction from a Single Image☆5,560Updated 11 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,715Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,115Updated 6 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,130Updated 8 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,074Updated 10 months ago
- PhotoMaker [CVPR 2024]☆10,022Updated 8 months ago
- MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips☆8,084Updated last week
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,905Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,170Updated last year
- Character Animation (AnimateAnyone, Face Reenactment)☆3,417Updated last year
- official repository of aiXcoder-7B Code Large Language Model☆2,270Updated last week
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,978Updated last year
- [CSUR] A Survey on Video Diffusion Models☆2,159Updated 3 weeks ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,583Updated 3 months ago
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,306Updated 7 months ago
- Your image is almost there!☆7,647Updated 11 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,204Updated 5 months ago