lucidrains / lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆268Updated 6 months ago
Alternatives and similar repositories for lumiere-pytorch:
Users that are interested in lumiere-pytorch are comparing it to the libraries listed below
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆305Updated last month
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆483Updated 8 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆478Updated 7 months ago
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆351Updated 9 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆401Updated 3 months ago
- ☆188Updated last week
- Unofficial PyTorch implementation of the VideoLDM.☆153Updated last year
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆487Updated 3 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆398Updated 7 months ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆412Updated 9 months ago
- Train VAE like a boss☆265Updated 3 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆450Updated 5 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆418Updated 4 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆232Updated 6 months ago
- Text to Image Latent Diffusion using a Transformer core☆161Updated 5 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆410Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆271Updated 3 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆115Updated 10 months ago
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆488Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆468Updated 3 months ago
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆330Updated 4 months ago
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆314Updated 7 months ago
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"☆382Updated last year
- Code for Fast Training of Diffusion Models with Masked Transformers☆388Updated 9 months ago
- Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]☆288Updated 11 months ago
- ☆83Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆381Updated 5 months ago
- Video-P2P: Video Editing with Cross-attention Control☆394Updated 7 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆289Updated last month
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"☆326Updated last month