lucidrains / lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆271Updated 8 months ago
Alternatives and similar repositories for lumiere-pytorch:
Users that are interested in lumiere-pytorch are comparing it to the libraries listed below
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆411Updated 5 months ago
- Train VAE like a boss☆274Updated 6 months ago
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆227Updated 4 months ago
- ☆192Updated 2 months ago
- Unofficial PyTorch implementation of the VideoLDM.☆156Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆266Updated last month
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆342Updated 3 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆516Updated 9 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆416Updated last year
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆499Updated 10 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆282Updated 5 months ago
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆82Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆131Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆477Updated 5 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆123Updated last year
- Code for Fast Training of Diffusion Models with Masked Transformers☆398Updated 11 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆467Updated 7 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆242Updated 9 months ago
- Code for instruction-tuning Stable Diffusion.☆227Updated last year
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆491Updated 5 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆274Updated 4 months ago
- Text to Image Latent Diffusion using a Transformer core☆178Updated 7 months ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆171Updated 10 months ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆425Updated 11 months ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆296Updated 9 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆402Updated last year
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"☆390Updated last year
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆365Updated 11 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆290Updated 6 months ago
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆492Updated last year