mini-sora/MiniSora-DiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mini-sora/MiniSora-DiT)

mini-sora / MiniSora-DiT

minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora

☆39

Alternatives and similar repositories for MiniSora-DiT

Users that are interested in MiniSora-DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mini-sora / minisora
View on GitHub
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
☆1,282Feb 18, 2025Updated last year
apple / ml-sid-dit
View on GitHub
☆46Oct 29, 2025Updated 6 months ago
randxie / instruct-nerf2nerf-gaussian-splatting
View on GitHub
☆27Sep 20, 2023Updated 2 years ago
narugo1992 / gchar
View on GitHub
Crawler and cleaner of data for novelai embedding's training
☆21May 22, 2025Updated last year
HINTLab / MELA-Challenge
View on GitHub
MICCAI 2022 MELA Challenge: Mediastinal Lesion Analysis (3D Detection)
☆11Jun 30, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NovelAI / k-diffusion-multigen
View on GitHub
Karras et al. (2022) diffusion models for PyTorch
☆17Oct 5, 2023Updated 2 years ago
lucasjinreal / ImageTokenizer
View on GitHub
imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…
☆40Jun 22, 2024Updated last year
DSaurus / threestudio-4dfy
View on GitHub
☆45Jan 12, 2024Updated 2 years ago
dongzhuoyao / uspace
View on GitHub
An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"
☆51Apr 10, 2024Updated 2 years ago
sfanxiang / videoshop
View on GitHub
Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
☆46Aug 1, 2024Updated last year
InternLM / InternEvo_bak
View on GitHub
InternEvo is a high-performance training system for giant models.
☆38Jan 17, 2024Updated 2 years ago
InternLM / .github
View on GitHub
☆36Sep 21, 2025Updated 8 months ago
lucasjinreal / LLaVA-Magvit2
View on GitHub
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆39Jun 20, 2024Updated last year
apluka34 / magic-animate-modified
View on GitHub
A modified version of origin Magic Animate (https://showlab.github.io/magicanimate/)
☆20Feb 27, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
oooolga / Ctrl-V
View on GitHub
👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"
☆35Jul 28, 2025Updated 9 months ago
Vchitect / Latte
View on GitHub
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,938Oct 30, 2025Updated 6 months ago
rtsao / css-in-js-perf-tests
View on GitHub
CSS-in-JS performance tests
☆10Jan 4, 2017Updated 9 years ago
FoundationVision / vaex
View on GitHub
🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook
☆107Jun 23, 2024Updated last year
snap-research / Panda-70M
View on GitHub
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
☆688Oct 25, 2024Updated last year
InternLM / InternLMWeb
View on GitHub
☆49Jul 12, 2023Updated 2 years ago
360CVGroup / Bridge_Diffusion_Model
View on GitHub
Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025
☆13Jun 25, 2024Updated last year
Mombin / speech2vid
View on GitHub
☆15Oct 28, 2019Updated 6 years ago
guanxiongsun / STPN
View on GitHub
[ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction
☆10Aug 17, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
00make / robodog
View on GitHub
A Python library for controlling AlphaDog robotic dogs.
☆12Apr 16, 2026Updated last month
tinnerhrhe / GARDO
View on GitHub
Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"
☆58May 3, 2026Updated 3 weeks ago
Vchitect / LaVie
View on GitHub
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
☆953Nov 13, 2024Updated last year
mattiasxu / Video-VQVAE
View on GitHub
VQVAE for video prediction
☆31Apr 22, 2022Updated 4 years ago
deepghs / hfutils
View on GitHub
Useful utilities for huggingface
☆25Dec 26, 2025Updated 4 months ago
xhan77 / jpeg-lm
View on GitHub
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations
☆15Sep 29, 2024Updated last year
TashanTec / TactiSim
View on GitHub
☆30Jun 10, 2025Updated 11 months ago
Ugness / MeBT
View on GitHub
official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…
☆28May 18, 2023Updated 3 years ago
legovaer / my-awesome-stars
View on GitHub
A curated list of my GitHub stars
☆15Mar 14, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
songweige / TATS
View on GitHub
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …
☆287May 1, 2024Updated 2 years ago
jy0205 / LaVIT
View on GitHub
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
☆604Oct 6, 2024Updated last year
cuixiaorui / mini-vite
View on GitHub
实现 mini vite ，学习 vite 原理
☆13Sep 19, 2021Updated 4 years ago
mendrik / diorama-2023
View on GitHub
An image layout algorithm for fixed sized canvases.
☆29Feb 16, 2026Updated 3 months ago
wangyouze / Trust-videoLLMs
View on GitHub
☆32Nov 25, 2025Updated 5 months ago
feizc / Stable-Edit
View on GitHub
Text-based real image editing with stable diffusion models
☆27Dec 19, 2022Updated 3 years ago
yataoz / face_reenact_GDPW
View on GitHub
Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation
☆12Jan 6, 2023Updated 3 years ago