Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,382May 31, 2024Updated last year
Alternatives and similar repositories for DiT
Users that are interested in DiT are comparing it to the libraries listed below
Sorting:
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,864Feb 29, 2024Updated 2 years ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,626Nov 10, 2025Updated 3 months ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,102Dec 22, 2025Updated 2 months ago
- ☆7,306Jul 2, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,873Feb 26, 2026Updated last week
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,920Oct 30, 2025Updated 4 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,863Feb 20, 2026Updated last week
- VideoSys: An easy and efficient system for video generation☆2,016Aug 27, 2025Updated 6 months ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,097Mar 25, 2023Updated 2 years ago
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,134Oct 29, 2025Updated 4 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,489Updated this week
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- Let us control diffusion models!☆33,663Feb 25, 2024Updated 2 years ago
- A collection of resources and papers on Diffusion Models☆12,273Aug 1, 2024Updated last year
- Official repo for consistency models.☆6,477Mar 22, 2024Updated last year
- Open-Sora: Democratizing Efficient Video Production for All☆28,632Apr 30, 2025Updated 10 months ago
- Fast Diffusion Models with Transformers☆929Aug 17, 2025Updated 6 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,500Aug 12, 2024Updated last year
- Fast and memory-efficient exact attention☆22,460Updated this week
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,560Mar 16, 2025Updated 11 months ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,615Jun 14, 2024Updated last year
- Release for Improved Denoising Diffusion Probabilistic Models☆3,799Jul 18, 2024Updated last year
- Generative Models by Stability AI☆26,943Dec 16, 2025Updated 2 months ago
- An open source implementation of CLIP.☆13,430Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,642Feb 18, 2026Updated 2 weeks ago
- Taming Transformers for High-Resolution Image Synthesis☆6,438Jul 30, 2024Updated last year
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Feb 24, 2026Updated last week
- Official Implementation of Rectified Flow (ICLR2023 Spotlight)☆1,552Jul 20, 2024Updated last year
- Elucidating the Design Space of Diffusion-Based Generative Models (EDM)☆1,915Mar 16, 2024Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,177Nov 18, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆4,167Jan 5, 2026Updated last month
- Denoising Diffusion Probabilistic Models☆5,054Aug 29, 2023Updated 2 years ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,449Nov 4, 2025Updated 4 months ago
- Mamba SSM architecture☆17,257Feb 18, 2026Updated 2 weeks ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,480Jun 28, 2024Updated last year
- Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)☆1,821Feb 6, 2024Updated 2 years ago
- ☆3,441May 14, 2024Updated last year