PKU-YuanGroup / Open-Sora-PlanLinks
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆11,988Updated 2 weeks ago
Alternatives and similar repositories for Open-Sora-Plan
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
Sorting:
- Open-Sora: Democratizing Efficient Video Production for All☆26,691Updated last month
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,305Updated 8 months ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,629Updated last week
- [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling:…☆8,239Updated last month
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,284Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,168Updated 5 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,293Updated 8 months ago
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"☆10,742Updated 11 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,956Updated 11 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,509Updated this week
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,109Updated 5 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,837Updated 2 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,668Updated 11 months ago
- MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips☆7,951Updated last week
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,198Updated 4 months ago
- Your image is almost there!☆7,645Updated 10 months ago
- Mora: More like Sora for Generalist Video Generation☆1,561Updated 8 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,423Updated last year
- Unofficial Implementation of Animate Anyone☆2,930Updated 11 months ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,634Updated 10 months ago
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,324Updated 3 weeks ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,266Updated 4 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,438Updated 2 weeks ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,495Updated 9 months ago
- [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,208Updated 11 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆22,102Updated last week
- Enjoy the magic of Diffusion models!☆8,842Updated this week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆8,356Updated 3 weeks ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆6,623Updated this week
- Kolors Team☆4,462Updated 7 months ago