lucidrains / lumiere-pytorchLinks
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆275Updated 10 months ago
Alternatives and similar repositories for lumiere-pytorch
Users that are interested in lumiere-pytorch are comparing it to the libraries listed below
Sorting:
- Unofficial PyTorch implementation of the VideoLDM.☆157Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆482Updated 6 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆286Updated 7 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆412Updated 6 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆361Updated 4 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆424Updated last year
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆493Updated 7 months ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆512Updated last year
- ☆195Updated 3 months ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆429Updated last year
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆369Updated last year
- Code for Fast Training of Diffusion Models with Masked Transformers☆403Updated last year
- Code for instruction-tuning Stable Diffusion.☆232Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆127Updated last year
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆232Updated 6 months ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆566Updated last year
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆564Updated 11 months ago
- Train VAE like a boss☆279Updated 7 months ago
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆334Updated 10 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆406Updated last year
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆473Updated 8 months ago
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"☆391Updated last year
- Video-P2P: Video Editing with Cross-attention Control☆411Updated 10 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆396Updated 3 months ago
- Official implementation of Inductive Moment Matching☆477Updated 2 months ago
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆238Updated last year
- Official implementation of MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation (https://arxiv.org/abs/…☆344Updated 2 years ago
- [CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space☆317Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆236Updated last year
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆82Updated last year