lucidrains / lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆250Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for lumiere-pytorch
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆389Updated last week
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆445Updated 5 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆459Updated 3 weeks ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆255Updated 2 months ago
- Unofficial PyTorch implementation of the VideoLDM.☆149Updated last year
- Train VAE like a boss☆247Updated last month
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆213Updated 3 months ago
- Code for Fast Training of Diffusion Models with Masked Transformers☆375Updated 6 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆444Updated 4 months ago
- Scaling Diffusion Transformers with Mixture of Experts☆207Updated 2 months ago
- Text to Image Latent Diffusion using a Transformer core☆145Updated 2 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆435Updated 2 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆242Updated 3 weeks ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆387Updated 4 months ago
- Code for instruction-tuning Stable Diffusion.☆212Updated 9 months ago
- MoVQGAN - model for the image encoding and reconstruction☆200Updated last year
- This repo contains the code for 1D tokenizer and generator☆554Updated this week
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆401Updated last year
- ☆78Updated 10 months ago
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆284Updated 4 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆341Updated last month
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆337Updated 6 months ago
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆193Updated last month
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆456Updated last week
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"☆307Updated last month
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆503Updated 3 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆98Updated 7 months ago
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆319Updated last month
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆245Updated 3 weeks ago
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆484Updated last year