Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,433May 31, 2024Updated last year
Alternatives and similar repositories for DiT
Users that are interested in DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,129Dec 22, 2025Updated 3 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,924Feb 29, 2024Updated 2 years ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,646Nov 10, 2025Updated 4 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- ☆7,319Jul 2, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,929Oct 30, 2025Updated 4 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆33,085Mar 18, 2026Updated last week
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,879Feb 20, 2026Updated last month
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,099Mar 25, 2023Updated 3 years ago
- VideoSys: An easy and efficient system for video generation☆2,020Aug 27, 2025Updated 6 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,538Mar 14, 2026Updated last week
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- Let us control diffusion models!☆33,752Feb 25, 2024Updated 2 years ago
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,144Mar 8, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open-Sora: Democratizing Efficient Video Production for All☆28,728Apr 30, 2025Updated 10 months ago
- Official repo for consistency models.☆6,474Mar 22, 2024Updated 2 years ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,585Mar 16, 2025Updated last year
- A collection of resources and papers on Diffusion Models☆12,297Aug 1, 2024Updated last year
- Official Implementation of Rectified Flow (ICLR2023 Spotlight)☆1,563Jul 20, 2024Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,603Aug 12, 2024Updated last year
- Release for Improved Denoising Diffusion Probabilistic Models☆3,811Jul 18, 2024Updated last year
- Fast and memory-efficient exact attention☆22,938Updated this week
- Elucidating the Design Space of Diffusion-Based Generative Models (EDM)☆1,921Mar 16, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Taming Transformers for High-Resolution Image Synthesis☆6,455Jul 30, 2024Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,861Feb 18, 2026Updated last month
- Generative Models by Stability AI☆27,024Dec 16, 2025Updated 3 months ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,613Jun 14, 2024Updated last year
- An open source implementation of CLIP.☆13,528Mar 12, 2026Updated last week
- Denoising Diffusion Probabilistic Models☆5,097Aug 29, 2023Updated 2 years ago
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆4,246Jan 5, 2026Updated 2 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,553Mar 12, 2026Updated last week
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,194Nov 18, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)☆1,826Feb 6, 2024Updated 2 years ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆594Apr 23, 2024Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,532Nov 4, 2025Updated 4 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,298Nov 27, 2025Updated 3 months ago
- ☆3,444May 14, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,502Jun 28, 2024Updated last year