lucidrains / lumiere-pytorchLinks
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆278Updated last year
Alternatives and similar repositories for lumiere-pytorch
Users that are interested in lumiere-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆446Updated 8 months ago
- [ECCV 2024, Oral] FMBoost: Boosting Latent Diffusion with Flow Matching☆244Updated 9 months ago
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆235Updated 2 years ago
- Code for instruction-tuning Stable Diffusion.☆240Updated last year
- Unofficial PyTorch implementation of the VideoLDM.☆158Updated 2 years ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆425Updated 10 months ago
- Implementation of a framework for Genie2 in Pytorch☆151Updated 8 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆143Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆426Updated 2 years ago
- Text to Image Latent Diffusion using a Transformer core☆208Updated last year
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆526Updated last week
- ☆86Updated last year
- Open reproduction of MUSE for fast text2image generation.☆357Updated last year
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆348Updated 11 months ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆178Updated last year
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆498Updated 10 months ago
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆498Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆492Updated 10 months ago
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"☆341Updated 6 months ago
- Train VAE like a boss☆292Updated 10 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆476Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆299Updated 10 months ago
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆384Updated last year
- [ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy☆260Updated 9 months ago
- Inference-time scaling of diffusion-based image and video generation models.☆168Updated 2 months ago
- Official Implementation of weights2weights☆148Updated 6 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆305Updated 8 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆406Updated 6 months ago
- ☆193Updated last year
- Code for Fast Training of Diffusion Models with Masked Transformers☆411Updated last year