PKU-YuanGroup / Open-Sora-PlanLinks
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆11,968Updated last month
Alternatives and similar repositories for Open-Sora-Plan
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
Sorting:
- Open-Sora: Democratizing Efficient Video Production for All☆26,530Updated last month
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,279Updated last year
- [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling:…☆7,996Updated last week
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,293Updated 8 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,826Updated last month
- Mora: More like Sora for Generalist Video Generation☆1,559Updated 7 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,985Updated 11 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,947Updated 10 months ago
- Your image is almost there!☆7,614Updated 10 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模 态对话模型☆8,174Updated last month
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,096Updated 7 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,562Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,130Updated 4 months ago
- Enjoy the magic of Diffusion models!☆8,713Updated last week
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,319Updated last year
- A series of large language models trained from scratch by developers @01-ai☆7,828Updated 6 months ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,514Updated 11 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,267Updated 3 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,630Updated 10 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,938Updated 9 months ago
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,813Updated 2 months ago
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,568Updated this week
- VideoSys: An easy and efficient system for video generation☆1,967Updated 2 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,053Updated 9 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,918Updated 8 months ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,476Updated 2 weeks ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,277Updated 7 months ago
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,706Updated last year
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,319Updated this week
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,659Updated 2 months ago