PKU-YuanGroup / Open-Sora-PlanLinks
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆12,022Updated last week
Alternatives and similar repositories for Open-Sora-Plan
Users that are interested in Open-Sora-Plan are comparing it to the libraries listed below
Sorting:
- Open-Sora: Democratizing Efficient Video Production for All☆27,251Updated 5 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,348Updated 11 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,320Updated last year
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,417Updated 4 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,264Updated 7 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,870Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,250Updated 8 months ago
- [AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"☆6,431Updated last year
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,350Updated last year
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,878Updated 5 months ago
- Mora: More like Sora for Generalist Video Generation☆1,571Updated 11 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,197Updated 11 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,442Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,972Updated last month
- Unofficial Implementation of Animate Anyone☆2,937Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,003Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,947Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,134Updated 8 months ago
- Official implementation of AnimateDiff.☆11,770Updated last year
- VideoSys: An easy and efficient system for video generation☆2,004Updated last month
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,668Updated last year
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,365Updated 10 months ago
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,720Updated last year
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,264Updated 2 weeks ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,651Updated last year
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,598Updated 6 months ago
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,744Updated 2 weeks ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,772Updated 7 months ago
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"☆10,851Updated last month
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,959Updated last year