PKU-YuanGroup / Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆11,887Updated 3 weeks ago
Alternatives and similar repositories for Open-Sora-Plan:
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
- Open-Sora: Democratizing Efficient Video Production for All☆23,326Updated this week
- [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling:…☆6,560Updated last month
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,177Updated 4 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,224Updated 3 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,256Updated last month
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆7,017Updated last month
- Mora: More like Sora for Generalist Video Generation☆1,546Updated 4 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,277Updated 2 months ago
- Your image is almost there!☆7,487Updated 6 months ago
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆6,043Updated 9 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆3,895Updated last month
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,405Updated 7 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,240Updated 9 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆6,824Updated 8 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,313Updated 8 months ago
- Latte: Latent Diffusion Transformer for Video Generation.☆1,773Updated 3 weeks ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆3,958Updated 5 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,085Updated 10 months ago
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,684Updated 7 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,445Updated 9 months ago
- VideoSys: An easy and efficient system for video generation☆1,917Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,444Updated 6 months ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,157Updated 7 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,600Updated 7 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆67,379Updated this week
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆10,677Updated 3 weeks ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,515Updated 7 months ago
- Enjoy the magic of Diffusion models!☆6,817Updated this week
- Official implementation of AnimateDiff.☆10,998Updated 6 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,833Updated 7 months ago