facebookresearch / DiTLinks
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆7,352Updated last year
Alternatives and similar repositories for DiT
Users that are interested in DiT are comparing it to the libraries listed below
Sorting:
- Denoising Diffusion Probabilistic Models☆4,429Updated last year
- [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling:…☆8,068Updated 2 weeks ago
- ☆6,794Updated 11 months ago
- Release for Improved Denoising Diffusion Probabilistic Models☆3,557Updated 10 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,527Updated this week
- Implementation of Denoising Diffusion Probabilistic Model in Pytorch☆9,395Updated 7 months ago
- Diffusion model papers, survey, and taxonomy☆3,193Updated 3 months ago
- An open source implementation of CLIP.☆11,853Updated this week
- Vector (and Scalar) Quantization, in Pytorch☆3,269Updated 3 weeks ago
- Consistency Distilled Diff VAE☆2,187Updated last year
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,008Updated 2 years ago
- Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)☆1,701Updated last year
- [CSUR] A Survey on Video Diffusion Models☆2,100Updated this week
- ☆1,819Updated 11 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆4,456Updated last week
- Official repo for consistency models.☆6,347Updated last year
- Efficient vision foundation models for high-resolution generation and perception.☆2,893Updated last month
- Denoising Diffusion Implicit Models☆1,636Updated 10 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,576Updated 8 months ago
- Taming Transformers for High-Resolution Image Synthesis☆6,185Updated 10 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,761Updated 9 months ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆2,900Updated last week
- Mamba SSM architecture☆14,982Updated last week
- VideoSys: An easy and efficient system for video generation☆1,967Updated 2 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,005Updated 3 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10,586Updated 6 months ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,420Updated 2 years ago
- This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing proces…☆1,887Updated 2 years ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,826Updated last month
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,340Updated this week