lucidrains / lumiere-pytorchLinks
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆277Updated 11 months ago
Alternatives and similar repositories for lumiere-pytorch
Users that are interested in lumiere-pytorch are comparing it to the libraries listed below
Sorting:
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆520Updated last year
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆416Updated 7 months ago
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆371Updated last year
- Unofficial PyTorch implementation of the VideoLDM.☆157Updated last year
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆496Updated last year
- ☆196Updated 4 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆402Updated 4 months ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆484Updated 7 months ago
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆504Updated 6 months ago
- Code for instruction-tuning Stable Diffusion.☆235Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆424Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆407Updated last year
- Code for Fast Training of Diffusion Models with Masked Transformers☆403Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆289Updated 7 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆411Updated 11 months ago
- Train VAE like a boss☆281Updated 8 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆495Updated 7 months ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆173Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆288Updated 3 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆244Updated 11 months ago
- Video-P2P: Video Editing with Cross-attention Control☆414Updated 11 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆589Updated 11 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆370Updated 5 months ago
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆341Updated 9 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆292Updated 8 months ago
- Official implementation of Inductive Moment Matching☆488Updated 3 months ago
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆282Updated last year
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆430Updated last year
- [ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.☆511Updated last year
- Implementation of MagViT2 Tokenizer in Pytorch☆610Updated 5 months ago