PKU-YuanGroup / Open-Sora-PlanLinks
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆11,988Updated last week
Alternatives and similar repositories for Open-Sora-Plan
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
Sorting:
- Open-Sora: Democratizing Efficient Video Production for All☆26,748Updated 2 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,288Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,174Updated 5 months ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,282Updated last month
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,527Updated last year
- [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,213Updated 11 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,474Updated last year
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,307Updated 9 months ago
- Official implementation of AnimateDiff.☆11,536Updated 11 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,886Updated 9 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆8,434Updated last month
- Kolors Team☆4,467Updated 7 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,264Updated 4 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,403Updated last year
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,838Updated 2 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,068Updated 10 months ago
- MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips☆8,023Updated last week
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,965Updated 11 months ago
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,254Updated 6 months ago
- VideoSys: An easy and efficient system for video generation☆1,980Updated 3 months ago
- Unofficial Implementation of Animate Anyone☆2,931Updated 11 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,165Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,610Updated last week
- Large World Model -- Modeling Text and Video with Millions Context☆7,302Updated 8 months ago
- Official Code for Stable Cascade☆6,588Updated 11 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,732Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,067Updated last year
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,685Updated 11 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,596Updated last year
- Latest Advances on Multimodal Large Language Models☆15,642Updated this week