PKU-YuanGroup / Open-Sora-PlanLinks
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆12,068Updated 2 weeks ago
Alternatives and similar repositories for Open-Sora-Plan
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
Sorting:
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,326Updated last year
- Open-Sora: Democratizing Efficient Video Production for All☆27,837Updated 6 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,356Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,266Updated 2 weeks ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,365Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,145Updated 10 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,883Updated 2 weeks ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,470Updated 5 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,267Updated 8 months ago
- Your image is almost there!☆7,654Updated last year
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,797Updated 8 months ago
- Mora: More like Sora for Generalist Video Generation☆1,578Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,010Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,235Updated 8 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,220Updated last year
- VideoSys: An easy and efficient system for video generation☆2,005Updated 2 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,187Updated last year
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,098Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,001Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,963Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,577Updated last year
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,645Updated last year
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,393Updated 11 months ago
- Unofficial Implementation of Animate Anyone☆2,936Updated last year
- official repository of aiXcoder-7B Code Large Language Model☆2,275Updated 4 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,848Updated last year
- [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,238Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,109Updated last week
- Character Animation (AnimateAnyone, Face Reenactment)☆3,446Updated last year
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,493Updated 5 months ago