facebookresearch / DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆6,716Updated 7 months ago
Alternatives and similar repositories for DiT:
Users that are interested in DiT are comparing it to the libraries listed below
- [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling:…☆6,393Updated this week
- Denoising Diffusion Probabilistic Models☆4,063Updated last year
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,745Updated 5 months ago
- A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.☆3,780Updated this week
- Implementation of Denoising Diffusion Probabilistic Model in Pytorch☆8,710Updated 3 months ago
- An open source implementation of CLIP.☆10,804Updated last week
- Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)☆1,599Updated 11 months ago
- VideoSys: An easy and efficient system for video generation☆1,875Updated 2 weeks ago
- Efficient vision foundation models for high-resolution generation and perception.☆2,554Updated this week
- Release for Improved Denoising Diffusion Probabilistic Models☆3,393Updated 6 months ago
- Denoising Diffusion Implicit Models☆1,531Updated 5 months ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆957Updated last year
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆8,910Updated this week
- Consistency Distilled Diff VAE☆2,148Updated last year
- [CSUR] A Survey on Video Diffusion Models☆1,906Updated last month
- Latte: Latent Diffusion Transformer for Video Generation.☆1,756Updated 3 months ago
- Karras et al. (2022) diffusion models for PyTorch☆2,373Updated last week
- Vector (and Scalar) Quantization, in Pytorch☆2,824Updated last week
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10,161Updated last month
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆4,968Updated 5 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,245Updated 3 weeks ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,394Updated last year
- ☆1,784Updated 6 months ago
- ☆6,469Updated 6 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆12,225Updated 10 months ago
- Official repo for consistency models.☆6,232Updated 9 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,460Updated 5 months ago
- Diffusion model papers, survey, and taxonomy☆3,063Updated last month
- Code for CRATE (Coding RAte reduction TransformEr).☆1,201Updated 2 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆9,654Updated 5 months ago