mini-sora/minisora

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mini-sora/minisora)

mini-sora / minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

☆1,282

Alternatives and similar repositories for minisora

Users that are interested in minisora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NUS-HPC-AI-Lab / VideoSys
View on GitHub
VideoSys: An easy and efficient system for video generation
☆2,025Aug 27, 2025Updated 8 months ago
Vchitect / Latte
View on GitHub
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,938Oct 30, 2025Updated 6 months ago
mini-sora / MiniSora-DiT
View on GitHub
minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora
☆39Mar 25, 2024Updated 2 years ago
PKU-YuanGroup / Open-Sora-Plan
View on GitHub
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆12,161Mar 8, 2026Updated 2 months ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,579May 31, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,252Feb 16, 2025Updated last year
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
ChenHsing / Awesome-Video-Diffusion-Models
View on GitHub
[CSUR] A Survey on Video Diffusion Models
☆2,290Apr 15, 2026Updated last month
hpcaitech / Open-Sora
View on GitHub
Open-Sora: Democratizing Efficient Video Production for All
☆29,002Apr 9, 2026Updated last month
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,948Aug 15, 2024Updated last year
snap-research / Panda-70M
View on GitHub
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
☆688Oct 25, 2024Updated last year
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,644May 8, 2026Updated 2 weeks ago
Tencent-Hunyuan / HunyuanDiT
View on GitHub
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
☆4,297Nov 27, 2025Updated 5 months ago
FoundationVision / VAR
View on GitHub
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…
☆8,686Nov 10, 2025Updated 6 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,008Nov 25, 2025Updated 5 months ago
baaivision / Emu3
View on GitHub
Next-Token Prediction is All You Need
☆2,409Jan 12, 2026Updated 4 months ago
willisma / SiT
View on GitHub
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆1,166Dec 22, 2025Updated 5 months ago
Meituan-AutoML / VisionLLaMA
View on GitHub
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
☆392Jul 9, 2024Updated last year
guoqincode / Open-AnimateAnyone
View on GitHub
Unofficial Implementation of Animate Anyone
☆2,928Jul 9, 2024Updated last year
luosiallen / latent-consistency-model
View on GitHub
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
☆4,619Jun 14, 2024Updated last year
ali-vilab / VGen
View on GitHub
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
☆3,154Jan 10, 2025Updated last year
Vchitect / LaVie
View on GitHub
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
☆953Nov 13, 2024Updated last year
openai / consistencydecoder
View on GitHub
Consistency Distilled Diff VAE
☆2,214Nov 7, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tencent-ailab / IP-Adapter
View on GitHub
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
☆6,567Jun 28, 2024Updated last year
AILab-CVC / CV-VAE
View on GitHub
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
☆285Dec 4, 2024Updated last year
Doubiiu / DynamiCrafter
View on GitHub
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
☆3,004Sep 8, 2024Updated last year
baofff / U-ViT
View on GitHub
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
☆1,107Mar 25, 2023Updated 3 years ago
AILab-CVC / VideoCrafter
View on GitHub
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
☆5,055Jan 9, 2026Updated 4 months ago
mira-space / Mira
View on GitHub
☆360Oct 21, 2024Updated last year
MooreThreads / Moore-AnimateAnyone
View on GitHub
Character Animation (AnimateAnyone, Face Reenactment)
☆3,502May 31, 2024Updated last year
baaivision / NOVA
View on GitHub
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆648Oct 29, 2025Updated 6 months ago
alibaba / animate-anything
View on GitHub
Fine-Grained Open Domain Image Animation with Motion Guidance
☆964Oct 18, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,504Updated this week
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,930Jan 8, 2026Updated 4 months ago
mira-space / MiraData
View on GitHub
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
☆523Sep 2, 2024Updated last year
pixeli99 / SVD_Xtend
View on GitHub
Stable Video Diffusion Training Code and Extensions.
☆733Jul 25, 2024Updated last year
lllyasviel / LayerDiffuse
View on GitHub
Transparent Image Layer Diffusion using Latent Transparency
☆2,205Jun 16, 2024Updated last year
Picsart-AI-Research / StreamingT2V
View on GitHub
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
☆1,631Mar 27, 2025Updated last year
aigc-apps / EasyAnimate
View on GitHub
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
☆2,262Mar 6, 2025Updated last year