feizc / DiT-MoE
Scaling Diffusion Transformers with Mixture of Experts
☆178Updated last week
Related projects: ⓘ
- ☆168Updated 2 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆175Updated this week
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆148Updated 2 months ago
- CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆210Updated 2 weeks ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆107Updated 3 months ago
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆32Updated 6 months ago
- ☆99Updated 6 months ago
- Transformer-Mamba Diffusion Models☆78Updated 2 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆279Updated 9 months ago
- MoVQGAN - model for the image encoding and reconstruction☆115Updated 10 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆105Updated last year
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆236Updated 3 weeks ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆357Updated 6 months ago
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆205Updated 4 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆95Updated 3 weeks ago
- ☆89Updated 2 months ago
- Implementation of "DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning"☆75Updated last year
- My Implementation of Adversarial Diffusion Distillation https://arxiv.org/pdf/2311.17042.pdf☆38Updated 4 months ago
- ☆93Updated 2 months ago
- Unofficial PyTorch implementation of the VideoLDM.☆144Updated last year
- Official implementation of FouriScale (ECCV2024)☆131Updated last month
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆299Updated 5 months ago
- VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆93Updated last month
- Implements VAR+CLIP for image generation☆64Updated last month
- This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation☆394Updated last week
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆107Updated 2 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆229Updated 8 months ago
- Scalable Diffusion Models with State Space Backbone☆146Updated 6 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆118Updated 2 weeks ago
- [ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy☆214Updated 5 months ago